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Abstract 


The  research  program  described  in  this  report  addresses  several  generic  avenues  of 
opportunity  in  the  advancement  of  a  sophisticated  component  technology  base  for 
applications  in  optical  signal  processing,  optical  information  processing,  and  optical 
computing.  As  such,  the  research  program  is  multifaceted  as  well  as  highly 
interdisciplinary,  spanning  activities  from  materials  growth,  processing  and  characterization 
through  device  invention  and  evaluation  to  preliminary  system  level  integration. 


The  primary  program  thrusts  include: 

(1)  Identification  of  the  fundamental  limitations  inherent  in  optical  information 
processing  and  computing  systems  that  derive  from  physical  laws,  and 
discrimination  of  these  limitations  from  those  performance  boundaries  that 
result  from  device  technological  and  materials  parameter  considerations; 

(2)  An  in-depth  analysis  of  optical  signal  processing,  optical  information 
processing,  and  optical  computing,  with  consideration  of  present  levels  of 
accomplishment  and  expected  future  development; 

(3)  The  invention  and  extensive  characterization  of  a  novel  differential 
interferometric  readout  technique  for  high  contrast  ratio  parallel  readout  of 
optical  discs  employed  as  two-dimensional  spatial  light  riKKlulators; 


(4)  A  critical  assessment  of  the  potential  for  fabrication  of  spatial  light  modulators 
and  volume  holographic  optical  elements  in  the  III-V  or  11- VI  compound 
semiconductor  materials  systems  through  the  use  of  layered,  multiple  quanmm 
well,  and  superlattice  structures; 

(5)  Analysis  and  implementation  of  real  time  volume  holographic  optical  elements, 
including  a  novel  multilayered  structure  (the  so-called  stratified  volume 
holographic  optical  element,  or  SVHOE)  with  unique,  potentially  programmable 
diffraction  characteristics; 

(6)  The  development  of  advanced  techniques  for  the  utilization  of  multi-dimensional 
dynamically  programmable  interconnections  in  hybrid  optical/electronic 
multiprocessors  based  on  VLSI  and  WSI  technologies; 

(7)  The  invention,  analysis,  and  development  of  a  novel  holographic  recording  and 
readout  technique  that  allows  highly  multiplexed,  weighted  interconnections  to 
be  established  in  real  time  photoreriactive  materials,  characterized  by  high 
throughput  efficiency  and  very  low  interchannel  crosstalk; 

(8)  The  practical  development  of  photorefractive  volume  holographic  optical 
elements,  including  the  incorporation  of  novel  two-layer  antireflection  coatings 
on  very  high  index  substrates  to  eliminate  the  reduction  in  diffraction  efficiency 
attributed  to  the  presence  of  multiple  reflections,  and  the  invention  of  methods 
for  the  elimination  of  a  catastrophic  collapse  of  the  internal  elecuic  field  in 
photorefractive  materials  under  holographic  recording  conditions; 

(9)  Optimization  of  the  growth  and  processing  of  important  electrooptic  single 
crystal  materials  such  as  bismuth  silicon  oxide,  lithium  niobate,  and  strontium 
barium  niobate; 

(10)  Development  of  novel  materials  characterization  techniques  for  dielectric  single 
crystals  and  thin  films,  such  as  the  electrooptic  measurement  of  the  volume 
resistivity  of  photorefractive  materials,  for  integration  with  the  growth  and 
processing  effort;  and 


5 


(11)  Analysis  and  optimization  of  a  number  of  candidate  electrooptic  spatial  light 
modulator  structures  based  on  bulk  single  crystal  materials,  including  the 
Photorefractive  Incohcrent-to-Coherent  Optical  Converter  (PICOC),  a  Pockels 
Readout  Optical  Modulator  (PROM)  with  no  dielectric  blocking  layers  and  a 
<1 1 1>  crystallographic  orientation,  single  channel  and  linear  array  total  internal 
reflection  spatial  light  modulators,  and  the  Optically  Modulated  Total  Internal 
Reflection  (OMTIR)  spatial  light  modulator. 
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PHOTONIC  MATERIALS  AND  DEVICES  FOR  OPTICAL 
INFORMATION  PROCESSING 
AND  COMPUTING  APPLICATIONS 


1 .  PROGRAM  SUMMARY 

The  research  program  described  in  this  report  is  focused  on  a  critical  evaluation  of 
advanced  photonic  materials  and  device  concepts  for  the  implementation  of  optical 
information  processing  and  computing  systems.  The  effort  ranges  from  a  detailed 
investigation  of  the  fundamental  physical  and  technological  limitations  that  impact  the 
potential  computational  gain  (e.g.  increases  in  throughput,  decreases  in  decision  time 
subsequent  to  processing,  or  minimization  of  the  energy  expended  during  computation)  of 
optical  information  processing  and  computing  systems,  through  the  invention  and 
characterization  of  key  enabling  devices  such  as  two-dimensional  spatial  light  modulators 
and  volume  holographic  optical  elements,  to  the  development  of  advanced  techniques  for 
materials  growth,  deposition,  and  processing  that  have  a  critical  impact  on  potential  device 
performance.  As  such,  the  research  program  is  necessarily  interdisciplinary,  involving 
both  faculty  and  students  with  physics,  mathematics,  electrical  engineering,  optics, 
materials  science,  and  chemical  engineering  expertise.  This  multifaceted  evaluation  of 
novel  materials,  device,  and  systems  concepts  has  been  directly  responsible  for  the 
invention  and  characterization  of  a  number  of  photonic  devices  and  materials  processing 
techniques  that  exhibit  both  high  performance  and  capacity  for  practical  manufacturing. 

The  research  period  covered  by  this  Final  Technical  Report  extends  from  15  May, 
1987  through  14  September,  1991.  In  addition  to  the  set  of  publications  attached  as 
Appendices  to  this  report,  several  additional  manuscripts  are  either  in  press,  have  been 
submitted  to  appropriate  technical  journals  as  detailed  in  the  publication  list,  or  are  in 


advanced  stages  of  preparation.  Preprints  or  reprints  of  these  manuscripts  will  be 
forwarded  under  separate  cover  as  they  beoxne  available. 

The  primary  program  thrusts  can  be  organized  into  three  principal  categories:  (1) 
fundamental  and  technological  limitations  of  q)tical  information  processing  and  computing; 
(2)  electrically  and  optically  addressed  spatial  light  modulators;  and  (3)  volume  holographic 
optical  elements.  The  principal  results  of  the  research  program  in  each  category  are 
outlined  below.  Further  technical  details  and  a  guide  to  the  various  publications,  as  well  as 
continuing  directions  of  research,  are  provided  in  Section  2  (Progress  During  the  Contract 
Period). 

A  detailed  study  of  the  fundamental  as  well  as  technological  constraints  that  apply  to 
an  optically  based  information  processing  and  computing  technology  has  been  undertaken. 
The  purpose  of  the  study  is  to  delineate  potential  areas  of  opportunity  that  can  be  addressed 
by  both  optical  and  photonic  techniques,  in  combination  with  electronic  technology  where 
appropriate.  A  major  result  of  the  study  thus  far  has  been  an  evaluation  of  the  tradeoffs 
inherent  in  computation  based  on  both  analog  and  binary  (digital)  representations.  Analog 
processing  is  favorable  from  an  energy  metric  perspective  whenever  the  computational 
complexity  of  the  algorithm  (or  architecture)  is  large  enough  to  overcome  an  inherently 
higher  representation  cost  for  a  given  dynamic  range.  A  related  result  is  that  currently 
available  analog  optical  devices  operate  much  closer  to  the  relevant  quantum  limits  than 
either  currently  available  binary  optical  or  electronic  devices.  These  results  are  in  the 
process  of  being  applied  to  analog  electronic  circuitry  as  well  for  direct  comparison.  In  a 
parallel  effort,  the  fundamental  limits  of  the  photorefractive  grating  recording  sensitivity 
have  been  established.  These  limits  clearly  explain  the  apparent  insensitivity  of  currently 
investigated  photorefractive  grating  recording  materials,  and  imply  several  promising 
opportunities  for  critically  needed  sensitivity  enhancement. 


rhree  dififerent  types  of  electrically  and/or  q)tically  addressed  spatial  light  modulator's 
(SLMs)  have  been  invented,  developed,  and  characterized  during  the  course  of  the  research 
program  to  date.  These  devices  include  an  optical  disc  spatial  light  modulator,  a  multiple 
quantum  well  (MQW)  asymmetric  Fabry-Perot  spatial  light  modulator,  and  the 
Photorefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC).  The  optical  disc 
spatial  light  modulator  leverages  existing  compact  disc  read-only  memory  (CD-ROM) 
technology  by  incorporating  area  encoding  techniques  and  a  differential  interferometric 
readout  configuration  to  yield  a  high  resolution  SLM  with  high  contrast  (100:1),  in  situ 
memcffy  capacity,  scrolling  capability,  and  near  term  insenion  prospects.  The  multiple 
quantum  well  asymmetric  Fabry-Perot  spatial  light  modulator  offers  potential  hybrid 
integrability  of  III-V  compound  semiconductor  modulation  elements  with  silicon-based 
detection  and  control  circuitry,  and  features  a  novel  inverted  cavity  design  made  possible  by 
the  incorporation  of  InGaAs/GaAs  strained  layer  multiple  quantum  wells  on  a  transparent 
GaAs  substrate.  The  Photorefractive  Incoherent-to-Coherent  Optical  Converter  is  a  two- 
dimensional  spatial  light  modulator  that  exhibits  remarkable  fabrication  simplicity  by 
making  use  of  the  incoherent  (image-bearing)  erasure  of  a  coherently  recorded  grating  in  a 
photorefractive  material  such  as  bismuth  silicon  oxide  (Bii2Si02o)> 

Associated  with  the  optical  disc  spatial  light  modulator  characterization  effort  has  been 
a  parallel  effort  to  develop  an  appropriate  optical  recording  media  test  facility.  To  this  end, 
we  have  participated  in  the  modification  of  a  commercially  available  optical  media  tester 
(Apex  Systems  OHMT-SOO)  that  is  capable  of  recording  two-dimensional  image-formatted 
patterns  on  a  wide  range  of  optical  disc  media  with  an  extremely  high  degree  of  track-tc- 
track  accuracy.  The  development  and  acquisition  of  this  major  piece  of  capital  equipment 
represents  a  significant  program  achievement,  and  has  allowed  for  preliminary  tests  that 
demonstrate  the  viability  of  optical  disc  based  spatial  light  modulators. 


In  the  area  of  volume  holographic  optical  elements  and  interconnections,  significant 
progress  has  been  achieved  on  three  separate  issues;  (1)  the  development  of  a  viable 
approach  to  highly  multiplexed  weighted  interconnections  based  on  holographic  techniques; 
(2)  the  characterization  and  implementation  of  Stratified  Volume  Holographic  Optical 
Elements  (SVHOEs);  and  (3)  the  development  and  characterization  of  photorefractive 
materials  and  devices  for  interconnection  applications.  During  the  research  program,  we 
have  invented  a  novel  approach  for  utilizing  volume  holographic  optical  elements  in 
conjunction  with  arrays  of  individually  coherent  but  mutually  incoherent  sources  and  two- 
dimensional  spatial  light  modulators  to  provide  highly  multiplexed  and  weighted 
interconnections  characterized  by  high  throughput  efficiency,  low  interchannel  crosstalk, 
and  capability  for  simultaneous  (as  opposed  to  sequential)  initial  recording  and  weight 
updates.  We  have  also  identified  and  demonstrated  a  number  of  novel  features  of  SVHOE 
devices  that  have  applications  in  programmable  interconnections,  tunable  frequency 
filtering,  and  wavelength  multiplexing/demultiplexing.  In  addition  to  the  investigation  of 
the  fundamental  limitations  of  photorefractive  materials  as  used  in  interconnection 
applications  described  above,  we  have  also  attempted  to  improve  the  practical 
implementation  of  photorefractive  devices.  This  effort  has  included  the  development  of 
multilayer  antireflection  coatings  that  eliminate  the  source  of  multiple  internal  reflections, 
resulting  in  100%  increases  in  the  saturation  diffraction  efficiency  and  two  beam  coupling 
gain;  the  identification  and  characterization  of  the  dramatic  electric  fieM  collapse  within 
photorefractive  crystals  that  occurs  during  grating  recording,  by  means  of  a  novel 
transverse  electrooptic  imaging  method;  the  development  of  techniques  to  minimize  or 
eliminate  the  electric  field  collapse,  resulting  in  increases  in  diffraction  efficiency  and 
response  time;  the  development  of  a  novel  electrooptic  technique  for  the  measurement  of 
very  high  dark  resistivities  in  electrooptic  crystals  (which  determine  the  grating  storage 
time);  the  analysis  of  the  polarization  properties  of  diffraction  in  optically  active  and 
electrooptic  photorefractive  materials,  including  the  effects  of  self-diffraction  during  grating 


recording;  and  the  observation  and  characterization  of  the  effects  of  microscopic  charge 
screening  on  sub-hologram  formation  in  photorefractive  materials. 


2. 


PROGRESS  DURING  THE  CONTRACT  PERIOD 


As  discussed  in  the  Program  Summary,  the  central  theme  of  this  research  program  is 
a  critical  assessment  of  the  prospects  for  the  implementation  of  optical  information 
processing  and  computing  systems,  based  on  a  parallel  assessment  of  advanced  photomc 
materials  and  device  concepts.  As  such,  the  principal  program  elements  have  included 
studies  of  the  fundamental  physical  and  technological  constraints  that  impact  current  and 
projected  computational  performance;  invention,  development,  and  characterization  of 
critical  optical  processing  and  computing  components,  such  as  one-  and  two-dimensional 
spatial  light  modulators  and  volume  holographic  optical  elements;  and  the  development  of 
advanced  techniques  for  materials  growth,  deposition,  and  processing  that  have  a  critical 
impact  on  potential  device  performance.  In  this  section,  we  describe  the  most  significant 
results  of  the  research  program  to  date,  provide  a  guide  to  the  various  publications  that 
document  these  results,  and  indicate  continuing  directions  of  research  where  appropriate. 


Fundamental  Physical  and  Technological  Constraints  on  Optical 
Information  Processing  and  Computing 

In  order  to  examine  the  fundamental  physical  and  technological  limitations  to  optical 
information  processing  and  computing,  we  have  considered  any  computational  process  to 
comprise  three  separate  functions:  the  representation  of  information,  the  implementation  of 
computational  complexity,  and  the  detection  of  results.  This  separation  provides  a  key  for 
the  analysis  of  any  proposed  algorithm,  architecture,  and  implementation  scheme  from  the 
perspective  of  a  given  metric,  such  as  the  total  energy  required  to  complete  a  particular 
calculaticMi.  In  calculating  the  total  energy  dissipation  inherent  in  a  computational  process, 
it  is  necessary  to  include  the  detection  and  storage  of  the  inputs,  the  implementation  of  the 
computation,  the  costs  of  communication  (interconnection)  within  the  processor,  and  the 
detection  and  communication  of  the  results.  The  energy  metric  is  of  considerable 


importance,  as  many  currently  envisioned  computational  systems  (both  optical  and 
electronic)  are  highly  power  consumptive,  and  are  therefore  limited  in  performance  by  the 
thermodynamics  of  cooling. 

Our  approach  has  been  to  examine  the  computational  process  for  a  number  of 
important  computational  functions  (e.g.  two-dimensional  Fourier  transformation,  image- 
based  correlations,  synthetic  aperture  radar  image  formation,  and  nonlinear  dynamical 
systems  such  as  neural  networks)  that  have  proven  difficult  to  implement  by  electronic 
means,  without  resort  to  large  scale  systems  that  have  size,  weight,  and  power 
characteristics  that  make  them  inappropriate  for  a  wide  range  of  applications.  We  have 
exanuned  both  the  fundamental  boundaries  implied  by  quantum  statistics  and 
thermodynamics,  as  well  as  the  technological  boundaries  implied  by  the  choice  of  particular 
components  within  a  given  implementation  (such  as  spatial  light  modulators  based  on  m-V 
compound  semiconductor  multiple  quantum  well  modulators  hybridized  with  silicon 
detection  and  control  circuitry,  for  example)  [Appendices  4,  7,  8, 9,  and  10;  Pubis.  4, 16; 
Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  19,  33,  43,  46,  51,  52,  56]. 

One  of  the  most  striking  results  to  emerge  from  this  study  has  been  a  realization  of  the 
dramatic  difference  in  energy  cost  that  exists  between  the  digital  (binary)  and  analog 
representations  of  a  given  number  at  the  same  probability  of  error  (the  equivalent  of  a  bit 
error  rate)  due  to  quantum  statistical  fluctuations  alone.  The  digital  representation  cost 
scales  as  the  logarithm  of  the  overall  dynamic  range,  while  by  contrast  the  analog 
representation  cost  scales  quadratically  with  increasing  dynamic  range.  This  result  has 
direct  implications  for  computation,  in  the  sense  that  computational  algorithms  based  on 
analog  representations  can  be  more  energy  efficient  at  the  fundamental  limits  only  if  the 
c(»nputational  complexity  of  the  process  implied  by  the  analog-based  algorithm  is  sufficient 
to  make  up  for  the  increased  representation  cost.  The  implications  of  this  result  also  extend 
to  the  representation  of  information,  for  example,  on  optically  addressed  spatial  light 


modulates,  in  which  case  a  clear  tradeoff  exists  among  resolution  (number  of  independent 
pixels),  fnune  rate,  dynamic  range,  and  sensitivity  for  a  given  probability  of  representation 
error.  In  many  cases,  the  stated  performance  characteristics  quoted  for  certain  spatial  light 
modulators  cannot  be  obtained  simultaneously  without  provision  for  an  unacceptably  high 
input  intensity. 

Given  the  difference  in  representation  cost,  we  have  compared  a  number  of 
implementations  of  such  computational  processes  as  the  Fourier  transform  that  are  analog 
or  digital,  and  optical  or  electronic  in  nature,  at  both  the  fundamental  (quantum-limited) 
boundaries,  and  at  the  current  limits  of  available  device  technologies  at  the  device,  circuit, 
and  systems  integration  levels.  The  results  are  quite  surprising.  At  the  fundamental  limit, 
analog  (optical)  and  digital  (electronic)  approaches  can  exhibit  comparable  energy  costs.  At 
the  technological  level,  however,  electronic  devices,  circuits,  and  systems  operate  at 
progressively  larger  factors  away  from  the  fundamental  limits,  whereas  currently  available 
optical  components  are  capable  of  operating  much  closer  to  the  appropriate  fundamental 
constraints.  We  are  in  the  process  of  extending  this  line  of  inquiry  to  include  analog 
electronic  circuits  and  systems,  as  well  as  hybrid  photonic  components  comprising  both 
optical  and  electronic  elements.  [Appendices  4  and  8;  Publ.  4;  Book  Ch.  3;  Conf.  Pres.  3, 
5,  7,  13,  15,  19,  33,  43,  51,  52,  56] 

A  second  major  avenue  of  investigation  has  been  to  examine  the  fundamental  and 
technological  limitations  of  spatial  light  modulation,  in  particular  focusing  on  the 
advantages  and  disadvantages  inherent  in  phase  and/or  amplitude  modulation,  on  methods 
for  optimizing  the  utilization  of  available  oscillator  strength  (particularly  in  multiple 
quantum  well  modulators),  and  on  the  optimization  of  asymmetric  cavities  to  enhance 
device  sensitivity,  contrast  ratio,  and  throughput  [Appendices  4, 7,  8, 9,  and  10;  Pubis.  4, 
16;  Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  19,  33,  43,  46,  51,  52,  56].  In 
particular,  we  have  examined  in  detail  a  hybrid  spatial  light  nxxlulator  with  InGaAs/GaAs 


multiple  quantum  well  modulators  that  utilize  the  quantum  confined  Stark  effect  in  an 
asymmetric  Fabry-Perot  cavity  configuradon,  in  conjunction  with  silicon  driver  chips  that 
contain  appropriate  detection  and  control  circuitry  [Appendices  8  and  10;  Publ.  16;  Book 
Ch.  3;  Conf.  Pres.  46,  51,  56]. 

Both  pure  phase  and  pure  amplitude  modulation  cases  have  been  examined  from  the 
perspective  of  minimizing  signal-dependent  amplitude  modulation  in  the  phase  modulator 
case,  and  of  minimizing  signal-dependent  phase  modulation  in  the  amplitude  modulator 
case.  The  results  of  the  study  indicate  that  reflection  amplitude  modulators  with  contrast 
ratios  in  excess  of  20:1  can  be  achieved  with  a  dynamic  range  of  about  50%  (implying  a  3 
dB  insertion  loss),  with  acceptable  signal-dependent  phase  modulation  and  optical 
bandwidth.  Examination  of  the  sensitivity  of  these  results  to  process-induced  variations 
(such  as  thickness  nonuniformities  characteristic  of  current  state-of-the-art  molecular  beam 
epitaxy  (MBE)  techniques)  revealed  that  such  spatial  light  modulator  designs  must  be 
detuned  firom  optimized  (theoretically  achievable)  performance  parameters  in  order  to  obtain 
the  requisite  uniformity  across  a  two-dimensional  array.  This  study  is  continuing  in 
conjunction  with  a  device  fabrication  and  characterization  effort  in  collaboration  with  Prof. 
Anupam  Madhukar's  research  group  at  USC. 

During  the  contract  period  we  have  also  undertaken  and  completed  a  study  of  the 
fundamental  physical  limitations  of  the  photoreffactive  grating  recording  sensitivity,  in 
order  to  evaluate  the  prospects  for  application  of  photorefractive  materials  to  multiplexed 
interconnection  applications  [Appendices  4  and  7;  Publ.  4;  Book  Chs.  2,  3;  Conf.  Pres.  3, 
5,  7,  13,  15,  33].  In  this  study,  we  determined  the  quantum-limited  photorefractive 
sensitivity  that  can  be  achieved  assuming  a  single  photoexcited  carrier  for  each  absorbed 
photon,  and  compared  the  resulting  optimum  charge  distribution  with  those  that  arise  from 
sinusoidal  grating  exposure  profiles  in  conjunction  with  realistic  charge  transport  models. 
These  results  clearly  explain  the  fundamental  origins  of  the  observed  relatively  low 


quantum  efficiencies  of  photoiefractive  recording  processes  in  materials  such  as  bismuth 
silicon  oxide  and  barium  dtanate,  and  point  out  several  potential  directions  for  enhancement 
of  the  sensitivity.  The  importance  of  the  grating  recording  sensitivity  in  optical  processing 
and  computing  systems  applications  is  that  it  establishes  the  maximum  reconfiguration  rate 
of  a  volume  holographic  optical  element  or  interconnection  device  that  can  be  achieved  with 
a  given  (average)  optical  power. 

Electrically  and  Optically  Addressed  Spatial  Light  Modulators 

During  the  course  of  the  research  program,  a  number  of  different  types  of  electrically 
and/or  optically  addressed  spatial  light  modulators  have  been  invented,  developed,  and 
characterized.  These  devices  include  an  optical  disc  spatial  light  modulator,  a  multiple 
quantum  well  (MQW)  asymmetric  Fabry-Perot  spatial  light  modulator,  and  the 
Photorefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC).  The  optical  disc 
spatial  light  modulator  is  electrically  addressed  (though  optically  written),  while  the  latter 
two  devices  are  optically  addressed.  These  three  devices  span  a  considerable  range  of 
spatial  light  modulator  functions  and  performance  characteristics,  as  well  as  potential  for 
inexpensive  manufacturability  and  near-term  insertion. 

Optical  disc  spatial  light  modulators  were  first  proposed  by  us  [Conf.  Pubis.  IS  and 
24]  in  order  to  take  advantage  of  the  significant  materials  and  device  development  leverage 
provided  by  commercially  available  CD-ROM  technology.  In  this  spatial  light  modulator 
approach,  the  format  of  an  optical  disc  is  altered  to  allow  for  the  recording  of  a  two- 
dimensional  image  in,  for  example,  a  1000  x  1000  pixel  array  within  a  1  cm2  area.  Grey 
scale  is  provided  by  area  encoding  techniques,  such  that  within  each  "pixel"  containing  100 
binary  bits,  any  number  of  bits  can  be  written  between  0  and  100,  providing  for  a  binary- 
encoded  analog  representation.  On  readout,  each  pixel  is  under-resolved  to  allow  the  grey 
scale  to  be  realized  without  imaging  individual  bits. 


In  OTder  to  accomplish  such  readout  with  a  contrast  ratio  compatible  with  the  desired 
grey  scale,  we  developed  a  novel  differential  inteiferometric  readout  configuration  to 
circumvent  the  inherently  low  contrast  ratios  (about  2:1)  characteristic  of  commercially 
available  CD-ROMs  that  are  optimized  for  digital  data  recording  applications.  In  this 
configuration,  a  shear  plate  is  used  to  create  two  orthogonally  polarized  readout  beams, 
displaced  from  each  other  along  the  track  direction  by  one  half  of  the  inter-bit  spacing.  On 
readout,  two  independent  images  of  the  optical  disc  are  created,  which  recombine  within 
the  shear  plate  (cancelling  the  displacement)  and  interfere  with  each  other  when  passed 
through  an  appropriately  oriented  polarization  analyzer.  A  tc  phase  shift  is  introduced 
between  the  beams  by  the  shear  plate,  such  that  the  beams  exactly  cancel  everywhere  except 
where  bits  have  been  written.  In  this  manner,  the  background  reflections  from  the  tracks 
and  grooves  are  eliminated,  and  written  bits  appear  as  a  pair  of  bright  dots  against  an 
essentially  black  background.  In  experimental  tests  of  the  technique  using  a  mirror  in  place 
of  the  optical  disc,  we  have  achieved  contrast  ratios  in  excess  of  6(X);1  as  limited  by  the  AR 
coatings  of  the  various  optical  elements.  On  double  sided  ablative  media  without  AR 
coatings,  contrast  ratios  of  5:1  have  been  measured,  whereas  on  single  sided  bump  forming 
media  the  experimentally  achieved  contrast  ratios  exceed  20: 1  [Appendix  11;  Pubis.  11,12; 
Conf.  Pres.  15,  24,  33,  36,  40,  47,  52,  55]. 

In  order  to  write  two-dimensional  images  directly  onto  an  optical  disc  in  a  format  that 
allows  for  parallel  readout,  the  traditional  sequential  bit  recording  process  must  be 
modified.  In  collaboration  with  Apex  Systems  in  Boulder,  Colorado,  we  have  developed 
an  optical  media  tester  that  is  capable  of  very  high  (sub-micron)  track-to-track  accuracy, 
and  of  recording  on  a  wide  range  of  optical  disc  media  including  ablative,  bump-forming, 
and  magneto-optic.  In  addition,  the  design  is  optimized  to  minimize  the  time  required  to 
record  each  two-dimensional  image.  Using  this  optical  media  tester,  which  was  acquired 
during  the  contract  period,  we  have  successfully  written  a  wide  range  of  both  binary  and 
analog  two-dimensional  images  on  various  optical  disc  media.  The  largest  images  written 


to  date  have  512  x  512  pixels  with  101  binary-encoded  grey  scale  levels.  Parallel  readout 
of  these  images  using  the  differential  interferometric  readout  configuration  has  been 
accomplished  with  striking  results.  Further  investigations  are  under  way  to  optimize  the 
receding  parameters  utilized,  as  well  as  the  readout  configuration. 

One  of  the  most  useful  features  of  the  optical  disc  spatial  light  modulator  is  its 
capability  for  operation  in  a  "scrolling"  readout  mode,  as  is  desirable  for  example  in 
synthetic  aperture  radar  image  formation.  In  the  scrolling  mode,  it  is  necessary  to  read  out 
the  entire  image  for  subsequent  processing  following  the  arrival  of  each  additional  line  of 
data,  with  the  consequent  deletion  from  the  Held  of  view  of  the  oldest  line  of  data. 
Operating  in  this  readout  mode,  the  optical  disc  is  currently  capable  of  parallel  readout  rates 
exceeding  5  terabits  of  infomiation  per  second. 

A  second  type  of  spatial  light  modulator  that  we  have  actively  investigated  during  the 
program  period  is  a  hybrid  SLM  in  which  the  active  modulator  elements  are  based  on 
InCaAs/GaAs  multiple  quantum  wells  in  an  asymmetric  Fabry-Perot  cavity  configuration, 
and  the  optical  address  function  is  carried  out  by  detection  and  control  circuitry  integrated 
on  a  silicon  chip  by  standard  VLSI  foundry  processing.  We  have  chosen  the  strained  layer 
InGaAs/GaAs  system  for  epitaxial  growth  on  GaAs  substrates  in  order  to  achieve  substrate 
transparency  at  the  operational  wavelength  of  about  950  nm.  This  substrate  transparency 
can  be  used  to  advantage  by  allowing  for  an  inverted  asymmetric  Fabry-Perot  cavity 
geometry  in  which  the  low  reflectivity  Bragg  mirror  is  grown  by  MBE  techniques  (in  Prof. 
Anupam  Madhukar's  laborattny)  below  the  MQW  structure,  and  the  high  reflectivity  mirror 
is  an  externally  deposited  dielectric  multilayer  coating  [Appendix  10;  Publ.  16;  Book  Ch.  3; 
C^nf.  Pubis.  46, 48, 49,  50, 51,  52,  54, 56],  This  configuration  relieves  the  MBE  growth 
process  of  including  the  (several  micron  thick)  high  reflectivity  Bragg  mirror  below  the 
MQW  structure,  at  once  minimizing  the  growth  time  and  complexity  as  well  as  the 
accumulation  of  strain  and  lattice  defects.  In  addition,  it  allows  for  face-to-face  contact  by 


means  of  flip-chip  bonding  techniques  between  the  GaAs  substrate  containing  the 
modulator  elements,  and  the  Si  chip  containing  the  detectors  and  control  electronics.  This 
in  turn  eliminates  the  need  for  vias  through  both  substrates,  as  is  characteristic  of  traditional 
hybridization. 

To  date  we  have  performed  a  thorough  analysis  of  the  potential  performance 
characteristics  that  can  be  expected  from  such  a  hybrid  SLM  based  on  InGaAs/GaAs 
MQWs  (as  well  as  on  AlGaAs/GaAs  MQWs)  in  order  to  determine  both  optimized  mirror 
and  quantum  well  designs,  as  well  as  the  sensitivity  of  such  designs  to  anticipated  process- 
induced  variations.  On  the  basis  of  this  analysis,  we  have  grown  (in  collaboration  with 
Prof.  Madhukar's  research  group)  a  number  of  candidate  modulator  structures,  for  which 
extensive  optical  and  electrical  characterization  is  in  progress  at  the  present  time.  In  every 
case  examined  thus  far,  the  predictions  based  on  our  design  analysis  have  been  borne  out. 
In  parallel  with  this  effort,  we  have  also  designed  and  fabricated  (through  the  Metal-Oxide- 
Semiconductor  Implementation  Service  (MOSIS)  operated  for  DARPA  by  USCs 
Information  Sciences  Institute)  a  number  of  silicon  chips  that  contain  various  types  of 
photodetectors  and  control  circuitry.  These  chips  are  fully  functional,  and  are  in  process  of 
being  characterized  to  evaluate  their  maximum  operational  bandwidth,  the  accuracy  of  the 
control  circuitry  functional  transformation,  and  the  optical  sensitivity  of  the  photodetectors. 
Preliminary  studies  of  the  requisite  flip-chip  bonding  requirements  are  under  way. 

The  final  type  of  spatial  light  modulator  that  has  been  developed  and  characterized 
during  this  research  program  is  the  Photorefractive  Incoherent-to-Coherent  Optical 
Converter  (PICOC),  which  has  the  unusual  feature  of  combining  photorefractive  volume 
holographic  grating  recording  techniques  with  incoherent-to-coherent  ccmversion  capability 
within  the  same  device  [Appendix  6;  Book  Ch.  1].  This  feature  at  once  allows  for  simple 
and  inexpensive  device  fabrication,  as  any  of  a  number  of  bulk  photorefractive  crystals  can 
be  used  to  configure  this  device  without  the  need  for  additional  device  processing.  In 


addition,  it  allows  for  a  number  of  unique  optical  information  processing  and  computing 
applications  that  involve  both  spatial  light  modulation  and  volume  holographic  recording 
techniques.  During  the  contract  period,  an  intensive  study  of  the  factors  that  affect  PICOC 
operational  mode,  readout  configuration,  sensitivity,  resolution,  and  contrast  ratio  was 
completed.  The  results  of  this  study  are  described  in  detail  in  Appendix  6. 

Volume  Holographic  Optical  Elements 

One  of  the  most  important  potential  advantages  of  optical  techniques  for  application  to 
information  processing  and  computing  systems  is  the  capacity  for  parallel,  densely  packed 
crosstalk-free  interconnections  among  multiple  processing  planes  provided  by  volume 
holographic  recording  and  reconstruction  techniques.  Should  real  time  implementations  of 
such  optical  interconnections  not  prove  viable,  this  advantage  may  prove  to  be  minimal,  if  it 
exists  at  all.  As  such,  a  critical  feature  of  this  research  program  has  been  the  evaluation  of 
the  prospects  for  high  performance  volume  holographic  interconnections,  as  well  as  the 
development  of  practical  photorefhictive  materials  and  devices. 

During  the  contract  period,  significant  progress  has  been  achieved  in  the  development 
of  a  novel  approach  to  highly  multiplexed  weighted  interconnections  based  on  holographic 
techniques,  on  the  characterization  and  implementation  of  Stratified  Volume  Holographic 
Optical  Elements  (SVHOEs),  and  on  the  development  and  characterization  of 
photorefiactive  materials  and  devices  for  interconnection  applications.  Progress  in  each  of 
these  three  areas  is  summarized  below. 

We  have  recently  discoverco  that  the  traditional  methods  for  recording  volume 
holographic  interconnections  lead  to  either  high  interchannel  crosstalk  in  the  case  of  fully 
coherent,  simultaneous  recording  schemes,  or  to  interchannel  crosstalk,  high  throughput 
losses,  and  complex  recording  schedules  in  the  case  of  sequential  (coherent  or  incoherent) 
recording  schemes.  By  utilizing  the  optical  beam  propagation  method,  we  have  simulated  a 
number  of  such  point-to-point  weighted  interconnections  with  multiple  holographically 


recorded  gratings,  and  have  been  able  to  quantify  the  degree  of  crosstalk  and  throughput 
loss  in  each  case  [Appendix  8;  Pubis.  9  and  10;  Book  Ch.  3;  Patent  1;  Conf.  Pres.  37,  38, 
39,  41,  42,  45,  49,  50,  52,  53,  54,  57].  In  addition,  we  have  identified  a  new  form  of 
crosstalk,  so-called  beam  degeneracy  crosstalk,  that  produces  substantial  interchannel 
crosstalk  even  in  the  incoherent  recording  case,  and  accounts  for  a  dramatic  throughput  loss 
as  well.  For  an  N  input  point  to  N  output  point  holographic  interconnection  recorded  by 
traditional  sequential  methods,  for  example,  the  optical  throughput  efficiency  is  reduced  by 
of  order  1/N.  This  loss  is  potentially  catastrophic  for  large  scale  interconnection  networks 
such  as  those  envisioned  for  optical  implementations  of  neural  networks. 

Subsequent  to  this  analysis,  we  have  invented  an  architecture  based  on  two- 
dimensional  parallel  source  arrays,  spatial  light  modulators,  and  volume  holographic  optical 
elements  that  is  capable  of  circumventing  the  crosstalk  and  throughput  problems,  as  well  as 
providing  for  simultaneous  rather  than  sequential  weight  updates  [Appendix  8;  Pubis.  9 
and  10;  Book  Ch.  3;  Patent  1;  Conf.  Pres.  37,  38.  39,  41,  42,  45,  49,  50,  52,  53,  54,  57]. 
In  this  approach,  a  two-dimensional  source  array  (e.g.  the  surface  emitting  semiconductor 
diode  laser  arrays  recently  announced  by  Bellcore  and  ATT)  is  employed,  in  which  each 
source  is  individually  coherent,  but  all  sources  are  mutually  incoherent  over  the  response 
time  of  the  holographic  medium.  Two  optically  addressed  spatial  light  modulators  are 
employed  to  represent  the  input  and  output  planes.  Parallel  illumination  of  the  spatial  light 
modulators  by  the  source  array  produces  (with  appropriate  optics)  sets  of  mutually 
incoherent  holograms  in  the  volume  holographic  recording  medium,  all  of  which  are 
doubly  angularly  multiplexed  to  avoid  the  effects  of  beam  degeneracy. 

Simulations  of  this  architecture  using  the  optical  beam  propagation  method  have 
shown  a  remarkable  reduction  in  interchannel  crosstalk,  accompanied  by  a  corresponding 
substantial  increase  in  optical  throughput  efficiency.  Laboratory  demonstrations  of  8  x  8 
and  2  x  4096  interconnections  in  real  time  photoreffactive  crystals  have  confirmed  the 


theoretical  and  numerical  predictions.  Extension  of  both  the  numerical  simulations  and  the 
laboratory  ctemonstrations  to  larger  array  sizes  is  under  way  at  the  present  time. 

The  Stratified  Volume  Holographic  Optical  Element  or  SVHOE  is  a  unique  optical 
element  capable  of  emulating  volume  holographic  diffraction  characteristics  in  devices 
consisting  of  only  thin  layered  photosensitive  materials,  and  at  the  same  time  exhibits  a 
large  number  of  novel  propenies  that  are  useful  in  array  interconnection,  wavelength 
multiplexing  and  demultiplexing,  and  spatial  frequency  filtering  applications  [Appendices  3 
and  5;  Pubis.  3,  5,  6,  7;  Conf.  Pres.  1,  2,  4.  6,  9,  11,  12,  14,  15,  16,  17,  21,  29,  32,  33, 
49,  50,  52]. 

An  SVHOE  consists  of  a  number  of  thin  photosensitive  layers  separated  by  substrate 
(or  buffer)  layers  that  are  optically  insensitive.  This  type  of  construction  is  particularly 
advantageous  for  materials  that  exhibit  large  photoinduced  refractive  index  variations,  but 
that  are  difficult  to  fabricate  in  the  thicknesses  required  for  highly  multiplexed  volume 
holographic  device  operation  (of  order  1  mm  to  1  cm).  Examples  of  such  materials  include 
the  commercially  available  DuPont  holographic  photopolymer  material,  nonlinear  organic 
materials,  and  III-V  compound  semiconductor  multiple  quantum  well  structures.  In 
operation,  the  device  is  exposed  to  two  coherent  recording  beams  that  interfere  in  the 
photosensitive  layers  to  record  the  hologram,  and  readout  is  performed  in  direct  analogy 
with  other  volume  holographic  optical  elements. 

A  number  of  novel  diffraction  characteristics  of  SVHOE  structures  have  been 
identified,  numerically  modeled,  and  in  most  cases  experimentally  demonstrated  during  the 
contract  period.  These  include  a  unique  periodic  angular  tuning  characteristic,  the 
emulation  of  Bragg-limited  angular  response  characteristics,  the  generation  of  regularly 
spaced  arrays  of  output  angles  (or  positions)  when  illuminated  by  a  strongly  focused  beam, 
spatial  frequency  notch  filtering,  wavelength  notch  filtering,  and  wavelength  multiplexing 
and  demultiplexing.  In  addition,  SVHOE  structures  fabricated  with  active  phototefractive 


materials,  such  as  m- V  con^und  semictxiductor  multiple  quantum  well  structures  that  are 
voltage  or  field  enabled,  are  capable  of  exhibiting  tunable  diffraction  characteristics  that  can 
be  externally  modulated  by  altering  the  distribution  of  applied  voltages  on  a  layer  by  layer 
basis.  Preliminary  studies  of  such  active  SVHOE  structures  in  quantum  well  devices  have 
been  completed,  and  fabrication  and  evaluation  of  test  structures  is  under  way.  Finally, 
demonstration  of  key  SVHOE  characteristics  with  the  DuPont  holographic  photopolymer 
material  has  been  achieved  [Publ.  7]  in  order  to  take  advantage  of  the  in  situ  fixing 
ctq)abilities  of  this  class  of  organic  materials. 

In  addition  to  the  investigation  of  the  fundamental  limitations  of  photorefractive 
materials  (as  used  in  interconnection  applications)  described  above  [Appendices  4  and  7; 
Publ.  4;  Book  Chs.  2,  3;  Conf.  Pres.  3,  5,  7,  13,  15,  33],  we  have  also  undertaken  a 
multifaceted  effort  to  improve  the  prospects  for  practical  implementation  of  photorefractive 
devices.  This  effort  has  included  the  development  of  multilayer  antireflection  coatings  on 
Bii2S*O20,  LiNbOa,  BaTiOs,  SBN,  GaAs,  and  CdTe  that  eliminate  the  source  of  multiple 
internal  reflections  within  the  photorefractive  material  [Pubis.  13, 14,  IS;  Conf.  Pres.  25, 
33,  34,  35, 49,  50, 52],  resulting  in  100%  increases  in  the  saturation  difriaction  efficiency 
and  two  beam  coupling  gain;  the  identification  and  characterization  of  the  electric  field 
collapse  within  photorefractive  crystals  that  occurs  during  grating  receding,  by  means  of  a 
novel  transverse  electrooptic  imaging  method  [Publ.  17;  Conf.  Pres.  27,  32,  33];  the 
development  of  a  novel  electrooptic  technique  for  the  measurement  of  very  high  dark 
resistivities  in  electrooptic  crystals  (which  determine  the  grating  stwage  time)  [Appendix  1; 
Publ.  1;  Conf.  Pres.  8, 18,  22];  the  application  of  the  electrooptic  measurement  technique 
to  the  determination  of  the  dark  resistivities  of  undoped,  doped,  and  nonstoichiometric 
single  crystals  of  bismuth  silicon  oxide  [Appendix  1;  Publ.  1;  Conf.  Pres.  8,  18,  22];  the 
analysis  of  the  polarization  properties  of  diffraction  in  optically  active  and  electrooptic 
photorefractive  materials,  including  the  effects  of  self-diffraction  during  grating  recording 
[Appendix  2;  Publ.  2;  Conf.  Pres.  26];  and  the  observation  and  characterization  of  the 


effects  of  microscopic  charge  screening  on  sub-hologram  formation  in  photorefractive 
materials  [Publ.  8;  Conf.  Pres.  30],  which  has  important  implications  for  both  the 
implementation  of  spatially  segmented  volume  holograms  in  real  time  materials,  as  well  as 
for  the  analysis  of  the  effects  of  macroscopic  space-variant  illumination  effects  in  the 
recording  of  full  aperture  volume  holograms. 
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Single  crystals  of  bismuth  silicon  oxide  (Bi,2SiO}a)  and  its  isomorphs  (including,  for  example,  bismuth  germanium  oxide 
(Bi,2Ge02o))  have  been  utilized  in  a  wide  range  of  active  electrooptic  and  acoustoopic  devices,  including  the  Pockels  Readout  Opiicai 
Modulator  (PROM),  the  PRIZ.  the  Photorefractive  Incohereni-toGoherent  Optical  Converter  (PICOC).  volume  holographic  storage 
devices,  and  surface  acoustic  wave  devices.  A  key  material  parameter  that  influences  device  performance  characteristics  is  the  volume 
resistivity,  which  is  difficult  to  measure  accurately  using  standard  techniques  in  refractory  oxides  like  Bi,2SiO20  due  to  its  large 
magnitude  (typically  >  10'^  Q  cm).  We  present  here  a  technique  for  the  measurement  of  such  very  high  resistivities  in  electroopiic 
matenais;  this  method  utilizes  the  electrooptic  modulation  induced  by  a  voltage  placed  across  the  (crystallographically  onented) 
sample  as  a  probe  of  temporal  voltage  transients  that  are  in  turn  directly  related  to  the  sample  volume  resistivity.  In  our  expenments. 
a  very  weak  optical  probe  is  frequency  modulated,  phase  detected,  and  employed  at  low  duty  cycle  to  avoid  ambiguities  due  to 
phoioconductive  voltage  decay.  The  technique  is  described  in  detail,  and  experimental  results  are  presented  on  a  number  of  undoped 
and  doped  samples  of  bismuth  silicon  oxide  grown  by  the  Czochralski  lechiuque. 


1.  Introduction 

Bismuth  silicon  oxide  is  a  wide  bandgap.  high 
resistivity  semi-insulator  that  is  also  photoconduc- 
tive,  electrooptic,  acoustooptic,  magnetooptic,  and 
optically  active.  As  such,  it  has  found  widespread 
application  in  optical  information  processing  and 
computing  components  such  as  spatial  light  mod¬ 
ulators  and  volume  holographic  optical  elements 
[1],  Examples  of  such  devices  include  the  Pockels 
Readout  Optical  Modulator  (PROM)  [2,3],  the 
Photorefractive  Incoherent-to-Coherent  Optical 
Converter  (PICOC)  (4),  the  Optically  Modulated 
Total  Internal  Reflection  Spatial  Light  Modulator 
(OM-TIR  SLM)  [5],  and  photorefractive  volume 
holographic  optical  elements  (VHOEs)  [1,6]. 

Several  important  operational  parameters  of 
both  electrooptic  spatial  light  modulators  and 
volume  holographic  optical  elements  depend  di¬ 
rectly  on  the  volume  resistivity  (p)  of  the  active 
electrooptic  material.  For  example,  the  dielectric 
relaxation  time  (r  —  p«o.  in  which  c  is  the  relative 
dielectric  permittivity  of  the  material  and  <o  is  the 
permittivity  of  free  space)  establishes  the  frame 
storage  time  in  the  PROM,  PICOC,  and  OM-TIR 


spatial  light  modulators,  as  well  as  the  maximum 
frame  integration  time.  In  addition,  the  dielectric 
relaxation  time  determines  the  holographic  grating 
storage  time  in  Bi,2SiO20  volume  holographic  opti¬ 
cal  elements.  Furthermore,  spatial  nonunifoimities 
in  the  resistivity  will  produce  undesirable  space- 
variant  erasure  and  image  decay  characteristics. 

Determination  of  the  volume  resistivity  also  has 
significant  implications  for  materials  characteriza¬ 
tion.  Just  as  in  the  case  of  semiconductor  materials, 
the  as-grown  resistivity  provides  an  indication  of 
both  overall  crystal  purity  and  crystallographic 
perfection,  as  well  as  potential  for  correlation  with 
impurity  and  dopant  analysis  techniques.  This 
latter  point  is  especially  significant,  since  such 
techniques  are  at  present  not  well  developed  for 
low  impurity/dopant  levels  in  dielectric  matrices. 
In  fact,  resistivity  variations  can  yield  indirect 
information  regarding  the  nature  of  states  (shal¬ 
low  level,  deep  level,  recombination  center)  in¬ 
duced  by  specific  selected  incorporants.  Finally, 
the  dark  resistivity  provides  a  critical  baseline 
against  which  the  photoconductive  sensitivity  can 
be  evaluated. 

Traditional  techniques  for  measuring  resistivi- 
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ties  of  order  10'^  12  cm  and  greater,  such  as  direct 
measurement  of  the  current-voltage  relationship, 
use  of  the  four-point  probe  [7],  and  Van  der 
Pauw ■>,  method  [8],  are  impractical  due  to  the 
resultant  high  voltages  and  extremely  low  currents 
implied  by  the  total  sample  resistance.  Several 
other  difficulties  bear  on  the  measurement  of  very 
high  resistivities,  including  the  surface  conductiv¬ 
ity  of  the  specimen  crystal  and  crystal  mount,  the 
possible  influence  of  humidity-enhanced  surface 
conduction,  and  the  dependence  of  the  em¬ 
pirically-derived  dark  resistivity  on  the  optical  and 
thermal  exposure  history  of  the  sample. 

We  have  utilized  an  electrooptic  measurement 
technique  as  described  herein  that  largely  cir¬ 
cumvents  these  difficulties,  in  order  to  evaluate 
the  dark  volume  resistivities  of  both  undoped  and 
doped  samples  of  single  crystal  bismuth  silicon 
oxide.  The  technique  is  applicable  to  a  wide  range 
of  electrooptic  refractory  oxides,  such  as  bismuth 
germanium  oxide  (Bij^GeO^o),  bismuth  titanium 
oxide  (BiijTiOjo).  .  barium  titanate  (BaTiO,). 
lithium  niobate  (LiNb03),  and  strontium  barium 
niobate  (Sr,Ba,  .^NbjOj). 

The  theoretical  basis  of  the  experimental  tech¬ 
nique  is  presented  in  section  2.  and  the  details  of 
the  method  are  described  in  section  3.  Results  of 
resistivity  measurements  on  representative  un¬ 
doped  and  doped  single  crystals  of  BiijSiOjo  are 
given  in  section  4.  Discussion  and  conclusions  are 
provided  in  section  S. 


2.  Theoretical  considenitiom 

The  essence  of  the  measurement  technique  is 
the  formation  of  a  parallel  plate  capacitor  with  the 
specimen  electrooptic  single  crystal  acting  as  the 
dielectric,  by  depositing  semi-transparent  counter- 
electrodes  on  two  optically  polished,  parallel  faces 
of  the  sample.  The  electrooptic  effect  is  then 
utilized  to  provide  a  direct  measurement  of  the 
instantaneous  voltage  applied  across  the  sample. 
A  high  voltage  is  first  applied  to  the  sample  in 
order  to  charge  the  capacitor  to  an  initial  voltage 
k'o.  and  is  then  disconnected  to  allow  the  resultant 
transient  voltage  decay  of  the  sample  capacitor  in 
parallel  with  its  internal  resistance  to  be  moni¬ 


tored.  The  time  constant  of  the  voltage  decay 
provides  a  direct  measurement  of  the  resistivity  of 
the  sample  independent  of  geometric  factors,  as 
outlined  below. 

Consider  a  single  crystal  sample  of  Bi,,SiO.„ 
oriented  along  the  [001]  direction  as  shown  sche¬ 
matically  in  fig.  1.  Bismuth  silicon  oxide  is  a  cubic 
(123)  non-centrosymmetric  crystal,  characterized 
by  a  single  value  of  the  electrooptic  coefficient 
r,,  =  r,-  =  Hence,  a  voltage  K  =  Cinond  applied 
across  the  transparent  conductive  electrodes  (in 
which  d  is  the  sample  thickness)  \vill  induce  prin¬ 
cipal  axes  along  the  [110]  and  [110]  directions, 
with  indices  of  refraction  given  by  [9]; 


'•(IIO)  ~ 


i  -3  «  p 
2''0'41^{001) 


(1) 


and 


'’(TlO)  “  ^0  2'*o'4l^IOOI|- 


(2) 


In  eqs.  (1)  and  (2)  above,  the  natural  optical 
activity  of  bismuth  silicon  oxide  is  neglected  (as 
will  be  discussed  further  below).  Consider  further 
propagation  of  an  optical  probe  beam  in  the  [001] 
direction,  polarized  along  the  [010]  direction.  The 
two  resultant  principal  components  will  experi- 


THANSfARINT  CONDUCTIVE 
ElECTROOeS 

Fig.  1.  Typical  dectrooptic  conflguraiion  for  measuremeni  of 
the  votume  resistivity  of  bismuth  silicon  oxide  (BiijSiO^). 
Both  crystallographic  and  electric-neld-induced  principal  axes 
are  shown  for  a  sample  oriented  along  the  (001)  direction. 
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ence  a  net  phase  shift  given  by: 

r^^dnd-^nlrg^V.  (3) 

Note  that  in  this  longitudinal  electrooptic  config¬ 
uration,  the  phase  difference  depends  only  on  the 
applied  voltage,  and  not  on  the  internal  electric 
field.  For  an  analyzer  oriented  along  the  [100] 
direction,  the  resultant  transmitted  intensity  will 
be: 

/-/q  sin^(r/2)  »sin^^^no'4i^')-  (^) 

This  expression  illustrates  the  one-to-one  corre¬ 
spondence  between  the  transmitted  intensity 
through  the  system  and  the  instantaneous  voltage 
across  the  crystal  (provided  that  the  applied  volt¬ 
age  is  always  less  than  the  half  wave  voltage 
I/, -X/2nJr«,). 

The  transient  voltage  decay  of  a  resistor  in 
parallel  with  a  capacitor  is  given  by: 

I/oexp(-//T),  (5) 

in  which  r  is  the  characteristic  RC  time  constant. 
For  a  parallel  plate  capacitor  of  sufficiently  high 
aspect  ratio  to  avoid  significant  field  fringing  ef¬ 
fects.  the  RC  product  is  approximately  equal  to 
the  dielectric  relaxation  time  p«o.  which  is  thus 
independent  of  geometric  parameters.  Since  the 
relative  dielectric  permittivity  of  BiijSiOjg  is 
known  <€”56  (lOj),  measurement  of  the  voltage 
decay  time  constant  directly  yields  the  desired 
volume  resistivity. 

Two  issues  are  particularly  worthy  of  note  at 
this  point  in  the  discussion.  First  and  foremost, 
the  measurement  technique  utilized  herein  (as  do 
ail  resistivity  measurement  techniques)  assumes  a 
linear,  or  at  least  piecewise  linear,  current-voltage 
relationship.  As  will  be  shown  in  section  4.  bis¬ 
muth  silicon  oxide  samples  exhibit  significant  non¬ 
linear  current-voltage  behavior  in  the  high  field 
regime  (>10'  V/cm).  Hence  the  measurement 
described  herein  is  more  accurately  a  measure  of 
the  differential  volume  (dark)  resistivity  at  the 
specified  initial  value  of  applied  voltage  V„.  We 
have  chosen  a  value  of  Vg  (1600  V)  which  is  of 
particular  relevance  to  the  device  applications  dis¬ 
cussed  in  section  1.  The  implications  of  this  point 


will  be  discussed  further  in  section  4. 

The  second  issue  concerns  the  absolute  orien¬ 
tation  of  the  single  crystal  sample,  the  effects  of 
finite  absorption  coefficients,  and  the  natural  opti¬ 
cal  activity  of  Bi^SiOjo.  As  described  in  section  3. 
an  essentially  arbitrarily  oriented  sample  with  any 
combination  of  linear  and  nonlinear  optical  prop¬ 
erties  can  be  easily  accommodated  by  the  simple 
device  of  obtaining  a  calibration  curve  of  trans¬ 
mitted  intensity  as  a  function  of  voluge  during 
the  initial  charging  of  the  capacitor.  The  only 
requirement  is  that  a  measurable  fraction  of  the 
transmitted  light  experience  an  electrooptic  (or 
electrorefractive)  effect  such  that  a  monotonic 
calibration  curve  can  be  established.  This  further 
eliminates  any  experimental  dependence  on  the 
measurement  wavelength,  the  electrooptic  coeffi¬ 
cient,  the  index  of  refraction,  the  crystal  thickness, 
the  angle  of  incidence,  and  the  optical  rotatory 
power. 

Finally,  a  similar  resistivity  measurement  tech¬ 
nique  has  been  described  previously  [11]  in  which 
the  sample  to  be  measured  is  connected  in  parallel 
with  an  electrooptic  crystal,  which  in  turn  is 
utilized  to  measure  the  voltage  decay.  This  tech¬ 
nique  is  applicable  for  the  measurement  of  resis¬ 
tivities  small  compared  with  that  of  the  calibrated 
sample. 


3.  Experimental  procedure 

The  basic  configuration  utilized  for  the  mea¬ 
surement  of  high  volume-resistivities  in  electro¬ 
optic  crystals  is  as  shown  in  fig.  2.  The  sample  to 
be  measured  is  typically  oriented  as  shown  in  fig. 
1  and  mounted  in  an  environmental  chamber  to 
allow  control  of  the  ambient  atmosphere.  The 
chamber  is  flushed  with  filtered  ultra  high  purity 
dry  N,  gas  prior  to  and  during  the  measurement 
procedure  in  order  to  minimize  adsorption  of 
surface  moisture  and  contaminants.  Incident  probe 
light  derived  from  a  polarized  helium  neon  laser 
(chosen  to  minimize  the  photoconductive  decay  of 
the  bismuth  silicon  oxide  samples)  is  spatially 
filtered,  collimated,  apertured.  and  passed  through 
a  final  polarizer  before  transmittal  through  the 
sample  at  near  normal  incidence  and  a  polariza- 
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Fig.  2.  The  expenmenlal  configuration  for  measurement  of  volume  resistivity  in  electrooptic  materials,  as  descnbed  in  detail  in 

section  3. 


tion  analyzer.  Light  passing  the  analyzer  is  de¬ 
tected  by  a  silicon  photodetector,  which  provides 
the  signal  channel  input  to  a  phase  sensitive  detec¬ 
tor  (Princeton  Applied  Research  Model  126  Lock- 
In  Amplifier).  The  reference  output  of  the  lock-in 
is  used  to  drive  an  optical  chopper  through  a 
controller  to  complete  a  phase-locked  loop.  This 
allows  operation  at  very  low  signal  amplitudes 
(incident  intensities),  again  minimizing  photocon- 
ductive  decay  of  the  sample  voltage.  The  average 
incident  power  during  the  course  of  the  measure¬ 
ment  is  further  reduced  by  utilization  of  neutral 
density  filters  and  a  computer  controlled  optical 
shutter  that  remains  closed  at  all  times  except 
during  brief  measurement  intervals. 

Voltage  is  applied  to  the  crystal  by  means  of  a 
programmable  high  voltage  power  supply  through 
a  single  pole,  single  throw  knife  switch.  Both  a 
mercury  relay  and  a  high  voltage  contactor  were 
employed  in  trial  runs  in  the  hopes  of  implement¬ 
ing  full  computer  control  of  the  voltage  disconnect 
sequence,  but  proved  to  have  intolerably  high 
parasitic  capacitances.  The  knife  switch  was  auto¬ 
mated  for  computer  control  by  incorporation  of  a 
long  throw  solenoid  interfaced  to  the  microcom¬ 
puter. 

In  operation,  the  microcomputer  increases  the 


voltage  across  the  sample  in  increments,  recording 
the  transmitted  intensity  at  each  incremental  volt¬ 
age.  This  produces  the  requisite  calibration  curw 
for  direct  interpretation  of  the  voltage  tranxent 
decay,  as  described  in  section  2.  Examples  of  vuvh 
calibration  curves  are  shown  in  fig.  3  for  the  fviur 
samples  employed  in  this  study.  Following  com¬ 
pletion  of  the  calibration  sequence,  the  optivjl 
shutter  is  automatically  closed,  and  the  kmte 
switch  automatically  opened  by  means  of  the  elec¬ 
tromechanical  solenoid.  The  microcomputer  con¬ 
trols  subsequent  data  acquisition  during  the  decjv 
transient,  controlling  the  shutter,  recording  thc 
transmitted  intensity,  and  recording  the  incunu- 
neous  laser  power  by  means  of  a  reference  photo¬ 
diode  for  purposes  of  normalization.  Voltage  Je 
cay  measurements  were  typically  recorded  c\cr\ 
IS  min,  with  five  independent  measuremenix  i 
the  intensity  at  a  given  time  averaged  to  form  c.u  n 
plotted  data  point.  Dependent  on  the  decav  tmtc 
constant  to  be  measured,  voltage  decay  runs  s  ir.cJ 
in  duration  from  IS  min  to  over  one  hundred 
hours. 

A  typical  series  of  voltage  decay  curses  oe 
plotted  in  fig.  4  for  one  of  the  und  ^ped  Bi  Sii  i 
samples.  Four  curves  are  shown,  for  average  p.  »cr 
levels  during  measurement  of  2  pW,  200  nW  ; 
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Fig.  3.  Electrooptic  calibration  curves  for  the  four  samples 
utilized  in  this  study.  The  quantity  T  is  the  intensity  transmis¬ 
sion  coefficient  of  each  sample  at  the  probe  wavelength,  uncor¬ 
rected  for  reflection  losses. 


Fig.  4.  Characteristic  voltage  decay  curves  for  undoped  sample 
BSO-51.  showing  the  dependence  of  the  decay  time  constant 
on  the  average  incident  power  level  over  the  time  span  of  the 
measurement. 


nW.  and  20  pW.  The  first  three  curves  show  the 
effects  of  enhanced  photoconductive  decay,  while 
the  20  pW  decay  curve  showed  little  sensitivity  to 
continued  decreases  in  average  power  level,  and 
hence  can  be  considered  representative  of  the  dark 
resistivity. 

It  should  be  noted  that  bismuth  silion  oxide 
exhibits  the  effects  of  multiple  trap  levels  [10],  and 
as  such  the  occupancy  state  of  these  traps  prior  to 
measurement  will  affect  the  obtained  values  of  the 
dark  resistivity.  It  has  further  been  observed,  for 
example,  that  pre-illumination  with  IR  can  signifi¬ 
cantly  alter  the  trap  occupancy  and  dark  conduc¬ 
tivity  [12].  In  order  to  standardize  the  initial  con¬ 
ditions  in  a  manner  representative  of  the  operating 
conditions  of  most  spatial  light  modulators  and 
volume  holographic  optical  elements,  the  samples 
were  pre-illuminated  with  intense  broad  band 
radiation  while  the  counterelectrodes  were  shorted 
together.  This  serves  the  further  purpose  of  uni¬ 
formly  distributing  any  accumulated  space  charge 
prior  to  measurement. 

Careful  sample  preparation  was  found  to  be 
critical  in  obtaining  consistently  repeatable  re¬ 
sults.  Single  crystal  samples  were  oriented  by  Laue 
back  reflection  X-ray  diffraction  techniques,  cut. 
and  polished  to  better  than  X/4.  The  samples 
were  then  ultrasonically  cleaned  in  semiconductor 
grade  TCE.  acetone,  and  methanol,  followed  by  a 
thorough  rinse  in  18  Mf2  cm  deionized  water. 
Indium  tin  oxide  (ITO)  transparent  counterelec¬ 
trodes  were  deposited  by  RF  magnetron  sputtering 
through  pattern-defining  masks.  Contact  to  the 
samples  was  made  through  opposing  conductive 
o-ring  gaskets,  and  in  some  cases  by  small  wires 
attached  to  the  ITO  electrodes  with  silver  paint. 
Between  measurements,  samples  were  kept  in  a 
dessicated  environment  to  avoid  adsorbed  mois¬ 
ture. 


4.  Resistivity  measurements 

Four  single  crystal  samples  of  Bi,,SiO,o  were 
chosen  for  measurement,  representative  of  both 
undoped  and  doped  crystals  grown  from  the  melt 
by  the  Czochralski  technique  [9].  The  four  crystal 
samples  and  their  characteristics  are  listed  in  table 
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Table  1 

Summary  of  Bi|,SIO;y  sample  characteristics 


Sample 

Description 

BSO-;8 

L  ndoped 

Nominally  pure 

BSO-51 

V ndoped 

Nominally  pure 

BSO-53 

Fe  doped  (5  ppm) 

Very  low  level  dopant 

BSO-4b 

Cr  doped  (180  ppm) 

Intermediate  level  dopant 

1.  The  first  two  samples.  BSO-28  and  BSO-51.  are 
representative  of  andoped  single  crystals  grown 
from  different  starting  batches  of  five  9’s  purity 
Bi^Oy  and  Si02  under  similar  but  not  identical 
growth  conditions.  Sample  BSO-53  was  grown 
from  the  same  starting  melt  as  BSO-Sl.  with  the 
addition  of  S  ppm  (atomic)  iron  (from  iron  oxide) 
to  the  melt.  Sample  BSO-46  was  intentionally 
doped  with  180  ppm  (atomic)  chromium  in  the 
melt  (from  chromium  oxide).  Using  reported  val> 
ues  of  the  segregation  coefficients  in  BifjSiO^o 
(Fe:  <0.05:  Cr:  1.8  (13]),  this  implies  crystalline 
doping  densities  of  approximately  250  ppb  or  less 
Fe  in  BSO-53  and  320  ppm  Cr  in  BSO-46.  The 
optical  quality  of  the  samples  was  determined  by 
careful  inspection  for  strain  induced  birefringence 
using  polarization  microscopy.  No  growth  induced 
strain  was  evident  in  any  of  the  samples. 

Characteristic  decay  curves  for  each  of  the  four 
samples  are  shown  in  fig.  5.  Sample  BSO-28  ex¬ 
hibited  the  shortest  time  constant  to  the  1/e  point 
(approximately  40  min),  and  Cr-doped  BSO-46 
exhibited  the  longest  (in  excess  of  100  h).  As 
discussed  in  section  2.  note  that  none  of  the 
voltage  decay  curves  shown  in  fig.  5  can  be  ade¬ 
quately  fit  by  a  straight  line  as  would  be  expected 
for  a  purely  exponential  decay.  This  is  a  clear 
indication  of  an  inherent  nonlinearity  in  the  cur¬ 
rent-voltage  characteristic  for  this  voltage  range. 
However,  the  calculated  variation  in  differential 
resistivity  along  each  of  the  curves  amounts  to  of 
order  a  factor  of  two. 

Resistivity  values  for  each  of  the  four  repre¬ 
sentative  samples  are  plotted  for  comparison  in  , 
fig.  6  as  a  function  of  the  average  incident  power 
level  over  the  time  span  of  the  measurement. 
These  values  were  calculated  from  the  first  two 
reliable  data  points  following  initiation  of  the 
voltage  decay  measurement  in  each  case,  and  as 


Fig.  5.  Charactenstic  voltage  decay  curves  for  all  four  Bi 
samples,  all  at  an  average  incident  power  level  of  20  pW  over 
the  time  span  of  the  measurement. 


such  closely  approximate  the  differential  resis¬ 
tivity  at  or  near  16(X)  V  for  each  sample.  Note  that 
each  curve  is  seen  to  saturate  with  decreasing 
Average  incident  power,  indicating  a  valid  assign¬ 
ment  of  the  dark  resistivity  at  the  lower  limiting 
power  level.  Furthermore,  note  the  variation  in 
slopes  for  higher  incident  power  levels,  which  im¬ 
plies  a  corresponding  variation  in  sample  photo¬ 
conductivity  at  the  probe  wavelength  of  6328  A. 

The  lowest  resistivity  sample  measured  is  BSO- 
28,  with  a  value  near  4  x  10'^  12  cm.  The  second 
undoped  sample  (BSO-51)  is  considerably  more 
resistive,  at  1.5  x  10*’  12  cm.  This  variation  can  be 
attributed  either  to  differences  in  starting  batch 
impurity  levels  or  to  distinct  growth  conditions. 
The  addition  of  only  5  ppm  Fe  to  the  melt  is  seen 
to  lower  the  resistivity  from  that  of  BSO-51  to  that 
of  BSO-53  (9  X  10'^  12  cm.),  while  the  addition  of 
180  ppm  Cr  to  the  melt  produced  a  sample  with 
the  highest  resistivity  observed  among  our  samples 
to  date  (BSO-46:  1.9  X  **  12  cm). 
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Fig.  6.  Measured  values  of  the  volume  resistivity  for  all  four 
BijjSiOjo  samples  as  a  function  of  average  incident  power 
over  the  time  span  of  the  measurement. 


It  should  be  noted  that  the  Cr  doped  sample 
was  quite  unusual  in  that  it  proved  to  be  quite 
insensitive  to  considerable  levels  of  illumination. 
For  example,  with  1600  V  applied,  minutes  of 
exposure  to  bright,  broad  band  illumination  caused 
negligible  voltage  decay,  as  did  continual  exposure 
to  nominal  ambient  room  illumination  levels.  The 
combination  of  an  increased  resistivity  with  a 
significantly  diminished  photosensitivity  suggests 
that  Cr  may  form  an  active  recombination  center 
with  a  large  capture  cross  section  in  bismuth  sili¬ 
con  oxide.  Other  dopants  in  Bif^SiOjo  have  also 
been  observed  to  both  decrease  the  photoconduc¬ 
tivity  in  the  visible  spectrum  and  change  the 
volume  resistivity  [14]. 

5.  Discussion  and  conclusions 

Several  device  implications  accrue  to  the  elec¬ 
trooptic  measurement  of  the  volume  resistivity  of 


bismuth  silicon  oxide,  particularly  with  regard  to 
its  potential  correlation  with  growth  and  doping 
conditions.  The  attainment  of  very  large  resistivity 
values  in  certain  electro-optic  single  crystals 
through  either  growth  modification  or  dopant  in¬ 
corporation  is  of  considerable  importance,  as  such 
values  imply  very  large  charge  storage  (frame) 
times  in  most  device  conHgurations.  In  addition, 
the  achievement  of  long  dielectric  relaxation  times 
implies  the  added  flexibility  of  decay-free  tem¬ 
poral  integration  processing  modes  in  both  opti¬ 
cally  addressed  electrooptic  spatial  light  modula¬ 
tors  and  photorefractive  volume  holographic  opti¬ 
cal  elements.  In  some  cases,  device  structures 
utilize  dielectric  blocking  layers  to  prevent  longi¬ 
tudinal  charge  transport  through  the  device.  An 
important  example  is  the  Pockels  Readout  Optical 
Modulator,  which  employs  vapor  deposited  pary- 
lene  organic  thin  films  as  high  resistivity,  high 
dielectric  breakdown  strength  blocking  layers.  In 
such  cases,  pre-determination  of  the  volume  resis¬ 
tivity  allows  the  effects  of  sub-optimum  dielectric 
blocking  layers  to  be  accurately  assessed  [15-17]. 

Effects  of  growth  modification  and/or  dopant 
incorporation  on  the  photoconductive  response  at 
particular  wavelengths  can  also  have  considerable 
impact  for  potential  enhancements  of  device  per¬ 
formance.  For  example,  increased  photoconductiv¬ 
ity  at  specified  (writing)  wavelengths  optimizes  the 
exposure  sensitivity  of  optically  addressed  spatial 
light  modulators.  On  the  other  hand,  decreased 
photoconductivity  at  specified  (readout)  wave¬ 
lengths  reduces  the  potential  for  optical  damage  in 
single  and  multiple  channel  electrooptic  modula¬ 
tors.  and  can  also  increase  the  available  readout 
gain  for  several  classes  of  spatial  light  modulators. 

We  have  presented  herein  a  technique  for  accu¬ 
rate  and  repeatable  determination  of  the  volume 
resistivity  (and  photoconductivity)  of  electrooptic 
single  crystals.  The  technique  is  quite  straightfor¬ 
ward  to  implement,  is  nondestructive,  and  allows 
for  in  situ  variation  of  parameters  such  as  external 
illumination,  ambient  atmosphere,  and  ambient 
temperature.  In  addition,  the  technique  does  not 
require  accurate  crystallographic  orientation  or 
sample  alignment  for  successful  implementation. 
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r&nracr— Ooppler-cnhanced  sclf-diffractiun  in  iwo-hcam  couplin; 
experiments  with  sillenite  crystals  exhibits  pronounced  optical  polar¬ 
ization  effects  due  to  the  concomitant  presence  of  natural  optical  activ¬ 
ity  and  clectric-Reld-induced  linear  birefrinEence.  Coupled  wave  equa¬ 
tions  that  describe  the  polarization  properties.  e.\cluslvc  of  self- 
diffraction  effects,  have  previously  been  derived  for  the  two  principal 
crystal  orientations  most  commonly  used  iir  photorcfraciive  recording. 
In  this  paper,  the  coupled  wave  equations  are  combined  with  a  linear¬ 
ized  model  of  the  phoiorcfractive  recording  process  (single  trap  level, 
single  mobile  charge  species)  to  analyze  the  impact  of  seir-diffraction 
effects  on  the  polarization  state  evolution.  .Numerical  solutions  of  these 
equations  yieid  optimum  conffgurations  for  enhanced  gain  and  im¬ 
proved  image  contrast  in  two-wave  mixing  with  such  materials.  In  ad¬ 
dition.  inclusion  of  optical  activity  in  the  model  emphasizes  the  contri¬ 
bution  of  this  effect  to  the  apparent  reduction  of  the  effective 
electrooptic  coefficient  of  bismuth  silicon  oxide  crystals. 

I.  Introduction 

MOST  interesting  class  of  photorefractive  media  is 
that  of  the  sillenite  crystals,  which  includes  bismuth 
silicon  oxide  (Bit;SiO:o.  or  BSO).  bismuth  gennanium 
oxide  (Bii:GeO:o.  or  BGO).  and  bismuth  titanium  oxide 
( Bii;TiO;o.  or  BTO)  (!].  These  crx-stals  e.xhibit  high  pho- 
toref^lcti^e  sensitivity  for  volume  holographic  grating 
formation  [I],  fast  response,  long  storage  times  under  dark 
conditions,  and  essentially  unlimited  recyclability.  Such 
cry  stals  are  potentially  useful  for  dynamic  real-time  and 
time  average  intenerometry'  [3],  nonlinear  optical  signal 
processing  [-t].  [5].  phase-correaed  image  propagation 
through  aberrating  media  [6].  optical  interconnections  [7]. 
spadal  optical  switching  [8],  seif-pumped  laser  resonators 
[9],  and  image  amplincanon  [10]. 

Light  diffraction  from  volume  hoiogianu  in  sillenite 
crystals  e.xhibits  pronounced  polarization  properties,  en¬ 
compassing  the  effects  of  both  natural  optical  activity  and 
electiic-8eid-induced  linear  biiefinngence.  A  detailed 
model  of  these  polarizatkw  properties  is  critical  for  ob¬ 
taining  maximum  perfbnnance  in  photorefractive  appli- 
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cations.  Fors.xample.  Herriau  «fra/.  [11],  [12]  have  dem¬ 
onstrated  a  technique  for  signiheantiy  improving  the 
holographic  image  quality  by  suppressing  spurious  scat¬ 
tered  light.  This  technique  relies  on  the  fact  that  each  of 
the  two  holographic  writing  beams  generally  evolves  into 
a  distinctly  different  polarization  state,  so  that  the  two 
beams  can  be*  distinguished  by  a  polarization  analyzer. 

Numerous  studies  of  the  polarization  properties  have 
been  published  for  each  of  the  two  principal  crj  stailo- 
graphic  orientations  (Figs.  1  and  2)  in  which  the  presence 
of  optical  activity  is  neglected  [131-[15].  The  effects  of 
optical  activity  have  been  incorporated  by  coupled  wave 
equation  techniques,  as  derived  by  the  authors  [16].  and 
independently  by  Vachss  and  Hesselink  [17],  which  de¬ 
scribe  transmissive  holographic  recording  geometries  :r. 
the  absence  of  self-diffraction  effects.  Mallick  er  al.  ha\  e 
derived  analytic  solutions  for  these  coupled  wave  eqaa- 
tions  in  the  limit  in  which  one  of  the  two  recording  bea.ms 
remains  undepleted  as  it  propagates,  and  in  the  limit  of 
perfect  Bragg  matching  [18].  Kukhtarev  et  al.  have  ce 
rived  a  set  of  coupled  wave  equations  which  includes  act.' 
optical  activity'  and  self-diffraction  effects,  and  have  ob¬ 
served  that  self-diffraction  can  significantly  modifv 
polarization  states  e.xhibited  by  the  light  beams  [19]  T'. 
detailed  analysis  presented  in  [19]  emphasized  a  reflerticr. 
holographic  recording  geometry,  but  the  coupled  wave 
equations  can  easily  be  recast  for  a  transmissive  geome¬ 
try.  The  scope  of  ute  preseut  paper  is  to  explore  the  im¬ 
pact  of  seif-diffriicaon  effects  on  the  polarization  proper¬ 
ties  of  volume  holograms  in  sillenite  crystals  recorded  .r 
a  transmissive  geometry  (as  shown  schematically  in  r:  .: 
3).  Operation  deep  within  the  Bragg  regime  is  assume, 
throu^out  the  following  analysis. 

Seif-diffraction  refers  to  the  process  whereby  the  two 
writing  laser  beams,  which  interfere  to  form  the  pho:o- 
leftactive  grating,  diffract  firom  the  forming  grating 
thereby  modifying  the  imerference  fringe  profile  deeper 
within  the  crystal  [20]*[22].  This  effect  modifies  both  ;he 
modulation  depth  of  die  grating  and  the  phase  of  the  fnnoe 
system.  The  result  is  a  temporal  and  spatial  evolution  -  f 
the  grating  strength  and  phase,  which  eventually  stab 
lizes  in  the  steady-state  limit  to  a  spatially-varying  grating 
strength  and  phase.  Reptesenutive  st^y-state  grating 
profiles  induct  by  selMiffraction  are  shown  in  Figs 
and  3.  based  upon  the  model  described  in  the  next  section 
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Fiy.  I.  The  {X^  9  <  001  > )  crystal  ocicuatioa  of  bisnwih  silkoa  oxide 
t  BiijSiO^s.  or  BSO)  for  vaiumc  holography.  The  dashed  line  repicsemt 
a  nonnai  to  the  entiance  face.  The  x.  y.  and  c  coordinate  system  shown 
»  the  left  of  the  crystal  is  assumed  in  the  coupled  wave  analysis.  The 
.V'lQ.  Tco.  and  Z(o  axes  shown  on  the  crystal  ^ee  refer  u  the  principal 
riectrooptic  axes  induced  by  the  applied  bias  electric  deid. 


8i„SO,o:  {Kj  i  <000} 


Fig.  2.  The  { Xo  X  <  001 ) }  crystal  orientation  of  Bii:SiO:e  for  volume 
holography. 


Fig.  3.  Optical  atTanicmeat  for  Ooppler-cnhanced  two-beam  coupling,  in 
which  the  temporal  ftetpiency  of  one  of  the  two  writing  laser  beams  is 
slightly  shifted  by  itBhctioa  ftom  a  piezaelecitically  driven  mirror,  cre¬ 
ating  a  moving  gratint  within  the  phoioteftactive  nwlium. 

Note  the  curious  stnicniral  features  of  the  grating  profiles, 
especially  for  the  { J.  <  001  > }  orientation  (defined  in 
Fig.  2).  In  one  case,  the  grating  strength  stays  essentially 
unifonn  for  the  conditions  considered,  but  the  grating 
phase  fronts  are  corrugated  rather  than  flat.  In  another 
case,  the  grating  phase  fronts  remain  flat  but  the  grating 
strength  exhibits  a  layered  structure. 

One  manifestation  of  self-diffraction  effects  can  be  en¬ 
ergy  coupling  between  the  two  recording  laser  beams,  in 
which  Che  intensity  of  one  of  the  beams  can  be  amplified 
at  the  expense  of  the  intensity  in  the  second  beam.  This 
phenomenon  has  been  extensively  investigated  in  the  lit¬ 
erature  [10],  t23]-t25].  Maximum  energy  coupling  oc- 


f 


Fig.  -i.  Self-dilfisctuMi  induced  growrh  jt  :he  totce  .'ieiJ  .'nouui.:- 
don  depth  as  a  nmcnon  uf  crystal  deem  for  vanou*  joniiy.:- 

laitons.  bias  ileid  of  6  kV  cm  ano  ;he  matcrai  rarsmeterii  ;or  BSO 
given  in  Table  1  are  assumed. 


Fig.  5.  Variation  of  dw  space  cfaaiyc  fleid  piiase  with  depth  in  the  cry  sttd 
resulting  ftom  self-diffraction  effecs.  tor  the  same  rKordin;  ;and:t:cr.i 
assumed  in  the  previous  ftguie.  The  phase  angle  pioneo  here  is  reiar.-.t 
to  the  phase  angle  at  ite  antmnct  fees  of  the  ciysiai. 

cun  when  a  90*  phase  shift  e.xists  between  the  optical 
interference  pattern  and  the  resulting  space  charge  field. 
This  optimum  phase  shift  occurs  nanirally  when  recording 
sutionary  interference  patterns  in  the  diffusion  regime,  but 
typical  space  charge  fields  and  hence  energy  coupling  ef¬ 
fects  are  low  in  this  regime.  The  photosensitivity  can  be 
significantly  enhanced  by  applying  a  bias  electric  field  las 
shown  in  Figs.  1-3),  but  the  phase  relationship  between 
a  sutionary  light  interference  panero  and  the  resulting 
space  charge  field  is  generally  not  optimum  for  energy 
coupling. 

At  least  two  techniques  have  been  demonstrated  for  en¬ 
hancing  the  seif-diffraction  process  for  energy  coupling 
applications.  One  technique  is  to  Doppler  shift  one  of  the 
recording  laser  beams,  for  example  by  reflecting  this  beam 
from  a  piezoelectrically-driven  constant  velocity  mirror, 
as  shown  in  Fig.  3  [10],  [23]-[25].  A  Doppler  shift  causes 
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the  light  interference  pattern  to  translate  with  speed  f. 
which  rnodines  the  phase  between  the  optical  interference 
and  space  charge  field  profiles.  One  panicular  grating 
speed  £’,p,  exists  which  asserts  the  90’  phase  shift  needed 
for  maximum  energy  coupling.  An  additional  benefit  of 
the  Doppler  technique  is  a  significant  increase  in  the  mag¬ 
nitude  of  the  space  charge  field,  potentially  by  as  much 
as  an  order  of  magnitude  (see  Fig.  6.  which  is  described 
more  fuily  in  Section  H-B).  .A  second  technique  for  en¬ 
hancing  energy  coupiing  is  to  record  a  stationary  interfer¬ 
ence  patte.m  with  a  raoidlv  reversinz  aoplied  bias  field 
[:6]. 

Seif-ditfraction  effects  can  also  be  manifested  as  an  al¬ 
teration  of  the  polarization  states  of  the  two  writing  beams, 
as  shown  in  Figs.  7-18.  These  figures  compare  represen¬ 
tative  evolutions  of  polarization  angle  and  ellipticity  for 
the  undepleted  pump  beam  and  for  the  signal  beams  as¬ 
sociated  with  three  alternative  recording  configurations. 
One  recording  configuration  involves  no  seif-diffraction, 
but  rather  presumes  a  photorefractive  grating  which  is 
uniform  in  both  amplitude  and  phase  throughout  the  vol¬ 
ume  of  the  hologram.  This  is  the  model  assumed  in  our 
previous  polarization  propenies  analysis  [16].  as  well  as 
those  of  Vachss  and  Hesselink  [17].  and  .Vlallick  et  at 
[IS].  The  second  recording  configuration  incorporates 
self-diffraction  effects  associated  with  a  stationary  inter¬ 
ference  pattern,  and  the  third  configuration  incorporates 
self-diffraction  effects  associated  with  a  moving  interfer¬ 
ence  pattern  which  can  enhance  energy-coupling  in  sillen- 
iie  crystals  [10].  [231-[25].  These  numerical  solutions  are 
reviewed  more  fully  in  Section  III-.A. 

Figs.  7-18  demonstrate  graphically  that  self-diffraction 
effects  ha^  e  a  profound  impact  on  the  polarization  prop¬ 
erties  of  light  diffraction  from  volume  holograms  in  sil- 
lenite  crystals,  especially  when  techniques  such  as  Dopp¬ 
ler-shifting  are  used  to  enhance  the  self-diffiaaion.  A 
model  of  light  diffraction  in  sillenite  crystals  which  in¬ 
corporates  self-diffraction  effects  is  detailed  in  Section  Q. 
and  the  sample  solutions  are  studied  in  detail  in  Section 
ni-.A.  To  understand  the  key  implications  of  the  numeri¬ 
cal  solutions,  ftindamentsd  featunes  of  the  diffraction  an¬ 
isotropy  are  reviewed  in  Section  ni-B.  Finally,  conclu¬ 
sions  are  drawn  from  the  analysis  in  Section  IV. 

n.  Oesouption  op  the  Model  a.no  Numerical 
Method 

The  complete  model  of  light  diffraction  in  the  presence 
of  self-difftaction  effects  synthesizes  two  princi^  com- 
ponetus.  One  component  specifies  the  light  diftaction, 
coupiing,  and  polarization  evolution  in  an  electrooptic 
medium  in  response  to  a  prescribe  quasi-static  electric 
field  panem  E(x,  z).  The  second  component  specifies  the 
photorefnetive  recording  of  a  space  charge  field  £sc(-^> 
c)  in  response  to  a  given  optical  intensity  profile  formed 
by  the  coherent  interference  of  two  intersecting  laser 
beams.  Each  componem  is  explored  separately  below,  and 
the  numerical  algorithm  for  combining  them  is  then  de¬ 
scribed. 


A.  Coupled  iVave  Epuations  for  Lisin  Diffracnon 

The  diffraction  of  light,  energy  coupling,  and  evolution 
of  the  polarization  states  for  two  intersecting  laser  beams 
in  a  volume  hologram  are  all  governed  by  a  set  of  coupled 
wave  equations,  one  ''t  for  each  of  the  two  principal  cr%’S- 
tailographic  orienta,..  These  two  sets  of  equations  have 
been  derived  and  discu.  d  in  detail  previously  [16]  and 
are  therefore  not  reproduced  herein.  The  paramete.-s  ap¬ 
pearing  in  these  equations  include  natural  circular  bire¬ 
fringence  and  electric-field-induced  linear  birefringence. 
The  electric  field  profile  E(.x.  z)  in  steady  state  is  as¬ 
sumed  to  have  the  form 

£(.r.  ;)  =  ±  1  [£sc(;)  'i'P  {‘Ko-x) 

■r  complex  conjugate]  { I ) 

in  w  hich  £o  is -a  bias  electric  field  applied  to  the  cry  stal  to 
enhance  the  photosensitivity  (shown  in  Figs.  1-3).  and 
£jc(c)  is  the  space  c.harge  field  induced  by  the  photore¬ 
fractive  effect.  Higher  order  spatial  harmonics  of  the  space 
charge  field  may  exist,  but  are  neglected  in  this  analysis 
because  these  harmonics  are  not  Bragg-matched  to  the  in¬ 
cident  light  beams. 

Given  a  particular  form  for  the  space  charge  neic 
£ic(;).  the  coupled  wave  equations  can  be  readily  inte¬ 
grated  to  give  the  light  intensities  and  polarization  states 
at  the  exit  window  of  the  volume  hologram.  In  our  pre¬ 
vious  studies,  the  space  charge  field  E^c  presumed  to 
be  unifonn  in  amplitude  and  phase  throughout  the  holo¬ 
gram.  However,  self-diflnraction  effects  are  known  to  in¬ 
duce  spatial  variations  in  the  amplitude  and/or  phase,  as 
shown  in  Figs.  A  and  5.  and  so  we  must  study  the  effects 
of  self-difffac:ion  on  the  photorefractive  recording  pro¬ 
cess.  considered  next. 

B.  Photorefracrive  Recording  Model 

In  the  photorefractive  recording  process,  a  space  charge 
field  £sc(c)  is  induced  in  response  to  an  optical  interfer¬ 
ence  panem  formed  between  two  writing  beams  with  am¬ 
plitudes  ff  and  5  (see  Fig.  3).  A  useful  parameter  for  ana¬ 
lyzing  the  photorefractive  recording  process  is  the 
modulation  depth  m  of  the  optical  interference  panem. 
which  is  related  to  the  light  amplitudes  R  and  5  by 

/n  -  Iff*  •  S/(lff|^  +  lS|*)  (2) 

in  which  the  asterisk  denotes  complex  conjugation,  and 
the  dot  denotes  an  inner  product.  The  mottailadon  depth 
m  as  defined  above  is  a  complex  number  with  phase  de- 
tennined  by  the  relative  pbax  of  the  optical  interference 
panem  with  reject  to  the  coordinate  system. 

A  linearized  recording  model  applies  whenever  the  in¬ 
tensity  of  one  of  the  recording  beams  is  significandy 
smaller  rh«n  that  of  the  second  beam,  such  that  ]  m  |  « 
1.  The  stronger  beam  is  generally  called  the  pump  beam, 
with  amplitude  ff.  and  the  weaker  beam  is  c^led  the  sig¬ 
nal  beam,  with  amplitude  5.  In  the  linear  recording  and 
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swady-jtate  limits,  the  -ipace  c.harge  field  can  be  -vrit- 
ten  as 

fjC  “  (j) 

at  least  to  first  order  in  m.  The  electric  field  factor 
remains  constant  throughout  the  volume  of  the  hologram, 
whereas  the  modulation  depth  m  varies  slowly  with  depth 
ccordinate  :  due  to  energy  coupling  and  also  due  to  the 
nunidentical  polarization  state  evolution  of  the  two  writ¬ 
ing  laser  beams. 

Detailed  e.xpressions  for  the  field  term  in  the  linear 
approximation  have  been  published  by  Kukhtarev  et  al. 
for  recording  stationary  gratings  using  a  single  mobile 
charge  species  and  single  trap  species  model  [22],  and 
alternatively  using  two  types  of  photoexcited  carriers  and 
two  trap  species  in  the  charge  transport  model  [19].  In  the 
context  of  the  present  analysis,  the  only  effect  induced  by 
the  additional  charge  and  trap  species  is  to  reduce  the  cou¬ 
pling  between  the  two  lig.ht  beams,  and  perhaps  to  modify 
its  phase.  For  simpliciry.  we  restrict  our  attention  to  a 
single  mobile  species/single  trap  species  model  for  the  re¬ 
mainder  of  this  article. 

.Analytic  expressions  for  die  field  factor  £,„  for  Dopp¬ 
ler-enhanced  recording,  assuming  a  single  mobile  charge 
species  and  a  single  trap  site-,  have  been  derived  by  Valley 
[25]  and  Refregier  et  ai.  [10].  Fig.  6  shows  typical  mag¬ 
nitudes  of  the  field  factor  £,ai  for  Doppler-enhanced  rec¬ 
ording  at  optimum  speed  to  maximize  the  energy  cou¬ 
pling.  based  upon  the  published  analytic  solutions,  and 
assuming  the  material  parameters  listed  in  Table  I.  The 
grating  speed  has  been  reoptimized  for  each  combination 
of  applied  bias  field  and  grating  spatial  ^quency. 

C.  Numerical  Algorithm 

A  typical  numerical  analysis  of  self-diAaction  effects 
proceeds  as  follows.  First,  one  specifies  two  incident  light 
beams,  including  their  intensities  and  polarization  states, 
at  the  entrance  window  z  «  0  of  the  photoreffactive  crys¬ 
tal.  From  this,  one  can  determine  the  moduladoo  de^ 
m(z  *  0)  of  the  intensity  profile,  and  hence  the  space 
charge  field  £sc(2  *  0)  and  the  bitefiingetu  coupling  fac¬ 
tor  (discussed  previously  in  [16]).  Next,  the  coupled  wave 
equations  can  be  integrated  dir^y  to  determine  the  light 
amplitudes  and  polarization  states  at  a  deeper  level  d; 
within  the  crystal.  This  defines  the  space  ch^e  field  rec¬ 
orded  at  that  depth  through  the  modulation  index  m.  The 
process  is  then  repeated  as  needed  until  the  exit  window 
of  the  photorefractive  crystal  is  reached.  The  numerical 
integration  can  be  performed  either  hy  a  Runge-Kutta  al¬ 
gorithm  [27],  [28]  or  by  the  anisotropic  optical  beam 
propagation  method  [29].  It  is  interesting  to  note  that  the 
computation  time  with  self-diffraction  effects  included  is 
only  mininuUy  longer  than  the  time  for  the  original  set  of 
coupled  wave  equations,  at  least  for  a  linear  model  of  the 
photorefractive  recording  process  in  steady  state.  Also,  it 
must  be  emphasized  that  the  numerical  solutions  pre¬ 
sented  in  this  article  apply  only  to  linear  recording  (Imj 
«  1 )  in  the  steady-sttte  regime.  This  concludes  the  de- 
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Fig.  6.  .Magnitude  oi  :iie  ypace  charge  neid  tacior  5,^  ^  a  :unc:;sn  or  -r.e 
grating  spanai  treuuency  for  varying  vaiues  of  une  oppiiev:  bias  be:a  a... 
achieved  by  the  Ooppier-enhanced  recording  lecnnique.  The  grating  ve¬ 
locity  has  been  reoptimized  for  each  point  on  ;he  cur^  es.  <'See  aiso  '.'aiiev 
(231  (OF  similar  curves,  i 
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Para.mete.rs  .Assumed  in  the  CALCvL^novi* 


3l.;5iO;3  at  .\ 

Parameter 

■  5i: 

\-iue 

Dimer.- ion 

refractive  index  i  ha  ) 

;.5i3 

dimersionie:. 

absorption  i  a ) 

I 

cm‘ 

optical  routory  power  i  p ) 

JS.o 

ai'zrsii  r.r. 

eiectrooptic  coesRc'.eni  i  ) 

i  <2 

pm 

intensity  ratio  (/|o //,(,) 

i.o  X  io*’ 

dimersicrieti 

trap  density  (,V,) 

l.O  X  10“ 

cjn“' 

dielectric  constant  (*) 

56 

dimensionieii 

electron  mobility  (p) 

0.03 

cm'  V  •  i 

election  recombination  rate  (t«) 

:.o  .<  10*" 

cm'  i 

•1321. 133]. 


scription  of  the  model  and  the  corresponding  numerical 
methods. 

m.  Discussion  of  Numexical  Solutions 

Having  defined  the  model,  let  us  now  turn  to  a  study  of 
representative  solutions.  Numerical  solutions  given  in 
Figs.  7-16  arc  explored  in  Section  III-.A.  To  understand 
aspects  of  these  solutions,  the  diffraction  anisotropies  in¬ 
herent  in  sillenite  crystals  in  the  absence  of  opticad  activ¬ 
ity  are  reviewed  in  Section  III-B.  Finally,  a  heuristic  ar¬ 
gument  is  proposed  in  Section  m-C  to  explain  some  of 
the  mote  sirildng  aspects  of  the  numerical  solutions. 

A.  Numtrical  Solutions 

Diffraction  and  polarization  properties  of  a  number  of 
recording  configurations  have  been  studied  numerically, 
with  results  as  shown  in  Figs.  7-16.  Configuration  param¬ 
eters  include  the  crystal  orientation,  the  applied  bias  field, 
the  grating  spatial  frequency,  the  assumption  of  either  a 
sutionary  or  a  Doppler-shifted  optical  interference  pat¬ 
tern,  and  the  incotporation  of  either  a  self-difffaction  or  a 
uniform  grating  recording  model.  Two  alternative  com- 
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binations  of  bias  neid  ind  grating  '.patial  frequency  have 
been  considered  as  reprscentative  of  diifereat  solution 
limits.  One  combination  consists  of  a  bias  electric  fieid  of 
6  kV  'em  and  a  grating  spatial  frequency  of  300 
cycles /  mm.  This  bias  field  is  high  enough  to  enhance  sig¬ 
nificantly  the  photosensitivity,  and  the  grating  spatial  fre¬ 
quency  ;s  high  enough  to  allow  for  e.xcellent  resolution  in 
any  reconstructed  volume  hologram.  In  the  second  com¬ 
bination.  the  bias  electric  held  is  increased  to  10  kV^-’em. 
which  ;s  about  the  upper  limit  for  3SO  in  practical  appli- 
ections.  The  grating  spatial  frequency  has  been  reduced 
rrom  300  to  70  cycles  .’mm.  even  though  this  reduces  the 
.nolographic  resolution  capacity,  because  at  this  spatial 
frequency  the  resulting  space  charge  held  for  the  case  of 
an  optimized  velocity  moving  grating  is  at  a  maximum  for 
a  10  kV/cm  bias  fieid.  as  shown  in  Fig.  6.  The  second 
combination  of  parameters  gives  almost  three  times  the 
coupling  strength  as  the  first  configuration,  for  the  case  of 
Doppler-enhanced  recording.  The  space  charge  field  fac¬ 
tors  5,j,  resulting  from  these  two  combinat'ons  of  param¬ 
eters.  assuming  the  material  paramete.'s  g  en  in  Table  I 
and  me  analytic  solutions  published  by  Refregier  et  al. 
[lOj.  are  shown  in  Table  [I.  The  phase  angles  quoted  in 
Taoie  II  give  the  phase  of  the  space  charge  fieid  relative 
to  the  optical  interference  pattern  in  each  case.  In  addi¬ 
tion.  an  incident  intensity  ratio  (signal  to  pump  beam)  of 
10"“  has  been  assumed  throughout  to  assure  applicability 
of  the  andepleted  pump  approximation.  It  can  be  expected 
that  similar  effects  will  occur  for  higher  incident  intensity 
ratios,  but  modincation  of  the  photorefractive  charge 
transpon  model  to  incorporate  the  effects  of  a  depleted 
pump  beam  may  then  be  necessary. 

Consider  first  the  inhomogeneiiies  induced  in  the  mod¬ 
ulation  depth  m  by  seif-diffraction,  which  have  been  cal¬ 
culated  for  the  6  kV/cm  bias  field.  300  cycles /mm  grat¬ 
ing  spatial  frequency  recording  combination,  with  results 
as  shown  in  Figs.  4  and  5.  Fig.  4  shows  the  photoreirac- 
tive  grating  modulation  depths  as  a  function  of  grating 
thickness  calculated  for  both  orientations,  as  well  as  for 
both  running  and  stationary  gratings.  For  the  case  of  sta¬ 
tionary  gratings,  self-diffraction  yields  signincam  oscil¬ 
latory  behavior  in  the  { iCc  H  <  001 )  }  orientation,  but  only 
minor  perturbations  in  the  {Jlc  ±  <001  >  }  orientation. 
On  the  other  hand,  in  the  case  of  tunning  gratings,  both 
orientations  yield  striking  modulation  effects.  In  particu¬ 
lar,  the  { /<(;  X  <  001  > }  configuration  exhibits  a  periodic 
layered  structure  in  the  modulation  depth,  with  regions  at 
panicular  grating  thicknesses  that  approach  zero  modu¬ 
lation.  The  photorefractive  grating  ph^  angle  is  plotted 
for  the  same  set  of  cases  in  Fig.  S.  For  both  orientations, 
the  phase  is  fixed  throughout  in  the  case  of  tunning  grat¬ 
ings  at  the  optimum  velocity.  For  stationary  gratings,  (he 
[Kc  <001  >  }  orientation  e.xhibits  an  oscillatory  phase 
behavior  throughout  the  depth,  while  the  { ITt;  R  <  001  >  } 
orientation  exhibits  a  nonlinear  warping  of  the  phase  fronts 
with  an  additional  component  of  periodic  behavior.  Note 
then  that  the  { A'c  X  <  OOl  >  }  orientation  in  the  case  of 
stationary  gretings  has  essentially  constant  modulation  but 
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oscillatory  phase,  while  the  same  orientation  in  the  case 
of  running  gratings  has  constant  phase  but  oscillator, 
modulation. 

Similar  profiles  have  been  calculated  for  the  10  kV  cm. 
70  cycle /mm  combination,  with  results  almost  exactly 
like  those  shown  in  Figs.  4  and  5.  The  phase  modulation 
is  almost  identical  to  that  shown  in  Fig.  5.  with  only  a 
very  slight  increase  for  the  larger  coupling  case.  Tne  am¬ 
plitude  growth  has  the  same  general  form  as  shown  in 
Fig.  4.  but  spans  a  larger  range  of  values. 

Next  consider  the  polarization  state  evolution  .-esults 
shown  in  Figs.  7-14.  The  advance  of  the  polarization  an¬ 
gle  for  the  { Ac  ;1  <  001  > }  orientation,  shown  in  Figs.  7 
and  9.  predominantly  follows  a  linear  trend  defined  by  the 
magnitude  of  the  optical  activity,  with  a  spatially  periodic 
oscillation  about  this  linear  progression.  The  periodic  os¬ 
cillation  is  induced  by  coupling  of  the  signal  beam  to  the 
pump  beam,  which  undergoes  its  own  spatially  periodic 
evolution  in  polarization  state.  In  the  absence  of  an  ap¬ 
plied  bias  field,  this  spatial  period  is  defined  by  d  = 
180* /p.  in  which  p  is  the  optical  rotatory  power.  In  the 
presence  of  a  bias-fieid-induced  linear  birefringence  C... 
the  spatial  period  is  somewhat  smaller,  and  is  defined  by 
d  -  180* /(p*  -T.Cl)'''.  The  periodic  perturbation  is 
strongest  for  Doppier-enhanced  seif-diffraction,  and 
weakest  for  the  uniform  space  charge  grating  model.  The 
evolution  of  the  ellipticity  for  the  { Ac  11  <  001  > }  orien¬ 
tation.  shown  in  Figs.  8  and  10,  also  e.xhibits  the  same 
periodicity.  (Ellipticity  is  defined  as  the  ratio  of  the  minor 
to  the  major  axes  of  the  polarization  ellipse,  with  opposite 
signs  for  right-handed  and  left-handed  electric  vector  ro¬ 
tation.)  The  ellipticity  for  the  Doppler-enhanced  self-dif¬ 
fraction  model  of  the  signal  beam  deviates  most  strongly 
from  chat  of  the  pump  beam. 

The  {Ac  X  <00l>}  orientation  exhibits  even  more 
striking  modulation  of  the  signal  beam  polarization  state 
by  self-diffiaction  effects,  at  least  for  Doppler-enhanced 
coupling,  as  shown  in  Figs.  11-14.  Note  the  strong  vari¬ 
ations  in  the  polarization  angle  for  the  Doppler-enhanced 
self-difTracted  signal,  as  shown  in  Figs.  11  and  13.  The 
sutionary  grating  model  with  self-diffraction  exhibits  a 
polarization  response  which  is  much  closer  to  that  shown 
by  the  uniform  grating  model.  The  ellipticity  evolution 
for  the  Doppler-enhanced  signal  beam  also  differs  remark¬ 
ably  from  that  of  the  stationary  grating  and  uniform  grat¬ 
ing  models,  as  seen  in  Figs.  12  and  14. 
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F:j.  7.  Evolution  ot  the  poianzation  angle  fnujoruisonenution)  for  dif¬ 
fraction  m  the  ( A'is  :l  ( 001 )  }  orienution  for  the  undepIcted  putnp  beam 
Isolid  line;,  signal  beam  vith  Ooppicr-enhancsd  scIf-dilTnctton  at  t-^ 
(dashed  line),  signal  beam  with  a  sutionary  grating  and  self-diifnction 
(single  dashed  line),  and  signal  beam  without  self-diffraction  assuming 
a  phase  graung  uniform  in  magnitude  and  phase  throughout  the  holo¬ 
graphic  volume  idouble  dashed  iinei.  The  pump  and  signal  beams  its 
assumed  to  enter  the  crystal  with  identical  linear  poiafizatian  suies.  with 
polanzation  angle  parallel  to  the  grating  wave  vector  and  the  applieu  bias 
ieid. 


Fig.  t.  Evoludoa  of  iba  ellipdciqr  tot  dURaoiM  ia  ita  { I  <001 )  | 
oriemaotM  (tor  the  smh  ttcorAng  conditiow  ctMsidmd  ia  Fig.  7. 

Figs.  15  and  16  show  the  degradation  in  energy-cou¬ 
pling  gain  with  increasing  amounts  of  optical  rotation  pd 
in  the  diffusion  and  drift  regiines,  respectively,  for  the 
two  principal  crystallographic  orienutions.  The  resuits 
shown  in  Fig.  IS  are  derived  fnm  analytic  solutions  pre¬ 
sented  by  Foote  and  Hall  [30],  while  the  results  shown  in 
Fig.  16  are  numerically  derived  from  the  coupled  wave 
equations.  In  both  ffgurn,  the  pump  and  signal  beams  are 
assumed  to  enter  the  photorefractive  medium  with  iden¬ 
tical  linear  polarization  states.  In  each  case,  the  incident 
polarization  angle  is  varied  from  0  to  180*  to  find  the 
minimum  and  the  maximum  coupling  gain;  the  optimi¬ 
zation  of  the  polarization  angle  is  repeated  for  each  value 
of  pd.  These  two  figures  were  derived  by  keeping  the  op- 


Fic.  9.  Evolution  of  the  polanzation  angle  for  dilfnciion  n  the  [  A'.j 
<00n  )  orientation,  aa  shown  in  Fig.  7.  but  for  a  bi^r.er  bias  betd  jf 
10  kV/cffl  and  a  tower  graung  spatial  freouency  of  TO  c^ctes .  mm.  Ani.:.b 
combine  to  give  almost  a  threefold  increase  in  the  coupling  strengt.n. 


Fig.  10.  EvolunoB  of  (te  dUpiiciqr  (tor  diSnctioo  ia  the  { A'^  !  '  TO  i 
orientation  for  die  same  recoidini  conditions  considered  m  F  g  ^ 


Fig.  II.  ETolmiadofdMpolarizaiiMaagle(torieceidingcond:i.c-i.  'b- 

lideiediaFig.  7.bai(tordMORtiogoaal  {Kc  X  <001>)  reccre  -g  n 
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E-.  olucian  of  the  eiliptic.iy  for  recording  conditions  considered  in 
Fij.  11- 


Fig.  15.  Degradation  of  energy  coupling  gain  T  'as  denned  in  [iC])  'j^ith 
increasing  levels  of  optical  rotation  pit.  m  which  p  is  the  opiicai  njutors 
power  and  <f  is  the  crystal  thicicneis.  for  the  two  principal  recording  on- 
eniations.  These  curves  are  based  upon  analytic  solunons  de.'tved  by 
Foote  and  Hall  for  diffusion  regime  recording  [30]. 


Fig.  13.  Evolution  of  the  polarization  angle  for  tecordinf  conditions  con¬ 
sidered  in  Fig.  9.  but  for  the  { JC;;  X  <  OQI ) )  onentation. 


Fig.  16.  Degradation  of  energy  coupling  with  increasing  levels  of  opiicai 
totaiion  pd  for  Ooppier-eahaneed  dtift-tegime  lecoiding.  Recording 
conditions  are  idemi^  to  those  assumed  in  Figs.  7  and  1 1 . 


Fig.  14.  Evolution  of  the  cilipticity  for  necofding  conditions  eonstdered  in 

Fig.  13. 


deal  routoiy  power  p  fixed  at  38.6*  / nun  and  varying  the 
crystal  thictaiess  d,  but  very  similar  curves  have  been  ob¬ 
tained  by  keeping  the  thickness  d  constant  and  varying  the 
routoiy  power  p.  These  curves  have  imporunt  implica- 
dons  for  holographic  experimenu  which  measure  the  ef- 
/eedve  electroopdc  coefficient.  These  measurements  are 
difficult  to  interpret  correctly  for  a  number  of  reasons,  one 
being  the  intricate  polatizadon  ptoperdes  of  volume  grat¬ 
ings  in  sillenite  crystals.  Figs.  13  and  16  indicate  that  a 
SO  percent  reduction  in  coupling  gain  can  easily  be  ac¬ 
counted  for  because  of  degradation  induced  by  optical  ac¬ 
tivity. 

We  have  also  explored  the  impact  on  energy  coupling 
gain  of  using  nonidentically  polarized  incident  pump  and 
signal  beams,  and  of  using  incident  beams  with  other  than 
linearly  polarized  sutes.  A  modest  increase  in  the  gain  of 
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order  :-!0  percent  nui  ':3een  jntcined  'or  penicuiar  ;on- 
nuu  ratio  ns. 

A  heansuc  argument  is  presented  m  Section  III-C  ;o 
e.'.piuin  some  or  :he  beiiavior  snown  in  figs,  7- 16.  First, 
however,  we  .need  to  review  the  fundamental  anisotrooies 
in  the  iiifraction  process  for  the  two  principal  orientations 
ot  siilenite  cr/stais  typically  employed  for  volume  no- 
iography. 

8-  fnnerertt  Dirfrac:ion  Anisocnpiiis 

.Vlucn  or  the  interesting  polar.zation  evolution  behavior 
e-thibited  .n  .-igs.  7-16  derives  from  the  anisotrooies  in¬ 
herent  in  the  iignt  diifraction  process,  independent  of  op¬ 
tical  activity.  When  the  optical  activity  terms  are  set  equal 
to  zero,  the  sets  of  coupied  wave  equations  with  seif-dif- 
frac'ton  etfects  included  can  be  completely  decoupled  into 
orthogonal  poiarization  eigenmodes.  and  analytic  solu¬ 
tions  can  be  obtained  for  each  eigenmode.  The  eigena.xes 
are  different  for  the  two  principal  recording  conngura- 
tions.  and  are  shown  by  the  (Xtq,  Y^q,  Z^q)  in  ^igs. 

I  and  2. 

For  e.xamcie.  the  |  A'/j  I  <  COI  >  |  orientation  shown  in 
Fig.  1  has  eig-na.xes  Xcq  and  Y;o  .aarailei  and  perpe.ndic- 
uiar  to  the  grating  wave  vector  and  the  applied  eiectric 
field,  respectively.  No  energy  coupling  is  e.xhibited  for 
light  polanzed  along  the  X^c  iigena.xis.  Instead,  ma.xi- 
mum  coupling  occurs  for  light  polarized  along  dte  orthog¬ 
onal  a.xis.  Yto  in  Fig.  1.  This  coupling  can  be  either  pos¬ 
itive  or  negative,  depending  upon  the  direction  of  the 
applied  electric  field.  We  assume  throughout  this  paper 
that  the  field  direction  is  oriented  for  positive  coupling. 

The  {Kc  i.  <  00 1  > }  orientation  has  its  two  eigena.xes 
XfQ  and  Y£o  oriented  at  45’  with  respect  to  the  grating 
wave  vector  and  the  applied  bias  field,  as  shown  in  Fig. 
2.  Positive  energy  coupling  gain  is  e.xhibited  along  one  of 
these  eigenaxes.  and  negative  energy  coupling  gain  is  ex¬ 
hibited  along  the  orthogonal  xxis.  although  the  identifi¬ 
cation  of  which  axis  e.xhibits  positive  gain  can  be  altered 
by  reversing  the  direction  of  the  applied  bias  field.  This 
can  be  understood  with  reference  to  the  distonion  of  the 
inde.t  ellipsoid  imposed  by  a  uniform  bias  electric  field. 
In  the  absence  of  any  electric  field,  the  index  is  isotropic, 
so  that  the  constant  index  surface  is  a  sphere.  A  uniform 
bias  electric  field  distorts  this  index  surface  into  an  el¬ 
lipsoid  with  principal  axes  aligned  along  the 
Zgo)  eigena.xes  shown  in  Fig.  2.  The  index  of  refraction 
is  increased  by  a  small  amount  An  along  one  of  the  axes, 
say  the  Xgo  3.xis.  while  it  is  decreased  along  the  Yeo  axis. 
Now  consider  a  sinusoidal  space  charge  field,  which  in¬ 
duces  a  corresponding  sinusoidal  phase  grating  for  light 
polarized  along  the  Xgp  eigenaxis.  and  a  similar  phase 
grating  for  light  polarized  along  the  Y^q  eigenaxis.  Note, 
however,  that  the  grating  induced  along  the  Ygo  eigenaxis 
is  130*  out  of  phase  with  respect  to  the  grating  induced 
along  the  Xgo  eigenaxis,  because  the  index  shift  along  the 
one  axis  is  equal  and  opposite  to  the  index  shift  along  the 
orthogonal  axis.  Therefore,  if  one  of  the  two  gratings  is 
optimally  phased  for  positive  energy  coupling,  then  the 


second  zratinc  must  be  phaseu  for  aeuccsve  inure;  ;::u- 
pUnu. 

The  [  X.-  L  {  XH  ;  e.xhibits  even  .more  intereiti.nc  po¬ 
larization  behavior  for  liaht  linearly  poiarizea  aionz  di¬ 
rections  bisecting  the  iigenaxes.  i.e..  eitner  parallel  or 
perpendicular  to  the  grating  wavevector  and  the  appiiea 
bias  field.  For  light  so  polarized,  the  volume  grating  ex¬ 
hibits  birefringeat  diifraction  propemies.  also  known  :n  the 
literature  as  anisotropic  Bragg  diffraction,  wherein  the  dif¬ 
fracted  light  has  polarization  orthogonal  to  the  inciaent 
light  [29].  This  birefnngent  mode  is  e.xhibited  only  ay  the 
{ A/j  s.  <  001  )  }  orientation,  not  by  -Jie  [  .v;  |  1  00 1  ;  ) 
orientation,  and  has  imponant  irr.piicaticns  on  the  polar¬ 
ization  properties  in  the  presence  of  seif-dirfraetten.  as 
expiained  ne.xt. 

C.  Comparison  of  :iie  Two  Principal  Orientations 

A  critical  insight  into  the  light  diifraction  process  in  the 
presence  of  self-diffraction  is  that  the  space  c.harge  grating 
can  only  be  recorded  where  the  pump  and  signal  beams 
share  the  same  poiarization  state,  not  when  they  have  or¬ 
thogonal  states.  Therefore,  an  interesting  aspect  to  stucy 
in  the  numerical  solutions  is  the  comparative  content  jf 
signal  light  intensity  with  poiarization  state  paraile:  :o  and 
onhogonal  to  that  of  the  local  pump  beam.  Note  the  use 
of  the  term  “local”— the  polarization  state  of  the  pump 
beam  continuously  evolves  as  it  propagates  through  t.he 
crysul.  so  that  the  signal  beam  polarization  decomposi¬ 
tion  of  interest  similarly  evolves. 

Representative  numerical  solutions  for  these  exvo  poiar¬ 
ization  components  are  shown  in  Fig.  17  for  the  { K,z  1 

<  001  > }  orientation  and  in  Fig.  13  for  the  { d. 

<  001  >  }  orientation.  The  two  polarization  components  m 
these  figures  are  labeled  “coherent”  and  •incoherent.  " 
referring  to  the  components  with  polarization  states  iden¬ 
tical  to  and  orthogonal  to  the  local  pump  beam,  respec¬ 
tively.  Note  that  the  coherent  component  dominates 
throughout  the  propagation  path  for  the  {Kc  !l  <  001  )  } 
orientation,  as  shown  in  Fig.  17.  Recall  that  this  orien¬ 
tation  exhibited  no  biiefringent  diffraction  modes  in  che 
absence  of  optical  activity.  Addition  of  optical  activity 
introduces  only  a  small  amount  of  biiefringent  diffraction. 
Compare  this  with  the  response  for  the  { Ac  X  <  00 1 ) } 
orientation  shown  in  Fig.  18.  Assuming  that  the  incident 
beams  are  polarized  to  maximize  the  e.nergy  coupling 
(which  Implies  different  incident  polarizations  for  die  two 
configurations),  the  growth  of  the  coherent  component  is 
identical  for  both  recording  orientations,  at  least  for  the 
first  half  millimeter  or  so  of  propagation.  For  the  { x 

<  001  >  }  orientation,  the  optical  activity  rotates  che  pump 
polarization  angle  so  that  after  the  first  millimeter  or  so 
of  propagation  significant  biiefringent  diffraction  is  en¬ 
countered.  At  this  point,  further  growth  of  che  coherent 
signal  light  is  halted,  while  the  incoherent  component 
continues  to  grow.  One  consequence  is  chat  further  grov^tb 
of  the  space  charge  field  which  supports  the  diffraction  is 
similarly  halted.  After  more  propagation,  say  at  about  the 
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Fijj.  17.  Growth  of  the  incoherent  and  coherent  signal  beam  component* 
for  self-diffr^ion  in  the  { Ac  II  <  001  > }  onenution.  The  coherent  com¬ 
ponent  is  denned  as  that  which  has  a  polarization  state  identical  to  (he 
local  pump  beam,  and  hence  can  interfere  coherently  with  the  pump  beam 
to  form  a  fringe  pattern  and  photoiefnctively  induce  a  space  charfe  field. 
The  incoheieat  component  has  an  ofthofonal  poiaruation  state,  and  hence 
can  only  erase  the  space  charge  field.  Recording  conditions  are  identical 
to  those  assumed  in  Figs.  7  and  S.  except  that  the  incident  polarization 
angle  is  90*  with  respect  to  the  grating  wave  vector  to  maximize  the 
energy  coupling. 


Fig.  18.  Crawth  of  the  incoherent  and  coherent  signal  beam  components 
for  self-diifnction  in  the  X  <001  > }  orientation.  Recording  con¬ 
ditions  ate  identical  to  those  assumed  in  Figs.  1 1  and  12.  except  (hat  the 
incident  polarization  angle  is  -4S*  with  respect  to  the  grating  wave  vec¬ 
tor  to  maximize  the  energy  coupling. 

2  mm  depth,  the  optical  activity  has  rotated  the  pump 
beam  polarization  enough  so  chat  now  significant  negative 
energy  coupling  is  experienced,  and  the  coherent  signal 
component  loses  intensity  rather  than  gains  in  this  region. 
No  comparable  birefiingent  modes  or  negative  gain  axes 
exist  for  the  { ||  <  001  >  }  orientation.  This  explains 
why  the  { ATc  II  <  001  >  }  orientation  exhibits  superior  en¬ 
ergy-coupling  gain  compared  with  the  {Kc  ±  <001  >} 
orientation,  as  has  been  reported  in  the  literature  [31]. 

IV.  Conclusions 

A  method  for  calculating  the  polarization  state  evolu¬ 
tion  in  photorefractive  sillenite  crystals  in  the  presence  of 


self-dilfraction  effects  has  been  described,  and  represen¬ 
tative  solutions  have  been  studied.  Seif-diffraction  effects 
are  shown  to  alter  the  diffraction  process  remarkably.  For 
the  { Ac  il  (  ) }  orientation,  self-diffraction  induces 

significant  energy  coupling  gain:  for  the  { X  <  001  > } 
orientation,  self-diffraaion  significantly  modifies  the  po¬ 
larization  states,  especially  for  Doppler-enhanced  record¬ 
ing.  Striking  spatial  inhomogeneities  are  induced  in  the 
space  charge  field,  especially  for  the  { A^  X  <  001  > }  ori¬ 
entation,  leading  to  striations  in  the  space  charge  field 
along  the  direction  of  propagation  for  the  case  of  Doppler- 
enhanced  recording,  and  essentially  constant  amplitude 
but  corrugated  phase  fronts  for  stationary  recording.  It  is 
interesting  to  note  that  the  complexity  and  time  to  com¬ 
pute  for  the  numerical  model  are  only  modestly  increased 
with  the  addidon  of  self-diffraction,  at  least  for  the  linear 
photorefractive  recording  model  in  steady  state  used 
herein.  The  diffraction  properties  of  the  two  principal  rec¬ 
ording  configurations  prove  to  be  distinctly  different,  and 
in  part  this  difference  is  attributed  to  fundamental  aniso¬ 
tropies  inherent  in  the  diffraction  process.  Finally,  the 
degradation  in  energy  coupling  gain  with  increasing 
amounts  of  optical  rotation  pd  has  been  calculated  for  a 
few  representative  cases.  This  degradation  con  amount  to 
of  order  50  percent  and  must  be  considered  in  any  holo- 
graphicaily-based  measurements  of  effective  electrooptic 
coefficients. 
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A  computational  algorithm  for  analyzing  diffraction  properties  of  optical  devices,  the  optical  beam  propagation 
method,  has  suggested  a  new  class  of  devices  by  which  Bragg  regime  (thick  grating)  response  can  be  obtained  from  a 
spaced  sequence  of  thin  grating  layers.  Such  stratified  volume  holographic  optical  elements  (SVHOE’s)  can 
emulate  distributed  volume  gratings  in  terms  of  diffraction  efficiency  and  angular  selectivity  and  in  addition  poesess 
periodic  diffraction  properties  that  might  serve,  for  example,  as  interconnections  for  optical  cellular  logic  arrays. 
SVHOE’s  also  offer  a  unique  capability  for  altering  the  device  diffraction  response  on  a  layer-by-layer  basis, 
allowing  for  control  of  both  the  diffraction  peak  width  and  the  angular  separation  of  adjacent  peaks. 


Holographic  optical  elements  are  of  fundamental  im¬ 
portance  to  a  number  of  applications  in  optical  infor¬ 
mation  processing  and  computing,  including  optical 
interconnections,  content-addressable  memories,  and 
various  linear  and  nonlinear  signal  processing  conflgu- 
rations.  Numerical  analyses  of  typical  volume  holo¬ 
graphic  structures  are  critical  for  characterizing  and 
optimizing  such  optical  elements.  One  of  the  most 
flexible  and  broadly  applicable  numerical  tools  for 
studying  the  diffraction  behavior  of  volume  holograms 
is  the  optical  beam  propagation  method  (BPM).^'^  In 
this  method,  the  distributed  optical  inhomogeneities 
that  characterize  a  typical  hologram  are  approximated 
by  a  discrete  sequence  of  physically  and  mathemati¬ 
cally  simplified  elements:  inflnitesimally  thin  phase 
and/or  polarization  modulation  layers,  interleaved 
with  optically  homogeneous  layers  of  Hnite  thickness. 
By  approximating  the  distributed  grating  with  a  suffi¬ 
ciently  large  number  of  these  discrete  elements,  the 
resulting  numerical  model  of  the  grating  can  be 
brought  arbitrarily  close  in  its  response  to  that  of  the 
desired  distributed  bulk  grating. 

The  BPM  concept  of  separating  the  modulation 
process  from  the  diffraction  process  in  turn  suggests  a 
new  class  of  optical  devices:  the  stratified  volume 
holographic  optical  element,  or  SVHOE,^  as  shown  in 
Fig.  1.  The  SVHOE  device  structure  consists  of  a 
sequence  of  thin  photosensitive  holographic  recording 
layers  that  perform  the  optical  modulation  function, 
interleaved  with  optically  passive  buffer  layers,  i.e., 
layers  that  impress  no  modulation  on  the  light  beam 
but  rather  allow  the  diffraction  processes  necessary  for 
thick  grating  response  to  occur.  A  grating  record^  in 
any  individual  modulation  layer  would  necessarily  ex¬ 
hibit  Raman-Nath  characteristics  because  each  re¬ 
cording  layer  is  quite  thin.  However,  a  surprisingly 
small  number  of  recording  layers,  each  spaced  from  its 
neighbors  by  a  passive  layer  of  appropriate  thickness, 
can  exhibit  pronounced  Bragg  regime  (thick  grating) 
response.  In  addition,  SVHOE  structures  exhibit 
striking  diffraction  characteristics  not  observed  in 
bulk  gratings,  as  demonstrated  below. 

The  SVHOE  structure  is  of  particular  interest  for 
materials  that  can  be  produced  in  only  finite  thickness 
layers  but  that  nonetheless  exhibit  either  strong  mod- 
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ulation  effects  or  fast  response  times.  Examples  of 
such  strongly  modulating  but  limited-thickness  media 
include  III-V  and  II-VI  compound  semiconductor 
multiple  quantum  wells  and  superlattices,  and  or¬ 
dered  noncentrosymmetric  polymer  layers  character¬ 
ized  by  significant  electro-optic  coefficients. 

For  simplicity,  the  following  discussion  emphasizes 
the  angular  alignment  sensitivity  of  the  grating  struc¬ 
ture;  alternative  performance  measures,  such  as  sensi¬ 
tivity  to  the  wavelength  of  the  readout  light,  corre¬ 
spond  closely  to  this  angular  alignment  sensitivity 
metric.  To  measure  the  angtilar  alignment  response, 
a  grating  structure  is  illuminated  with  a  well-colli¬ 
mated  single-wavelength  laser  beam.  That  fraction  of 
the  incident  light  intensity  diffracted  into  an  adjacent 
diffraction  order,  such  as  the  +1  order,  is  measured 
with  a  photodetector  as  a  function  of  the  tilt  angle  of 
the  grating. 

The  angular  response  for  a  representative  distribut¬ 
ed  bulk  grating  is  shown  in  the  top  half  of  Fig.  2  to 
serve  as  a  reference  for  comparison  with  sample  re¬ 
sponse  curves  for  typical  SVHOE  structures.  Such  a 
uniform  volume  (or  distributed  bulk)  grating  has  an 
alignment  sensitivity  curve  characterized  by  essential¬ 
ly  a  sinc^  profile.  The  angular  width  of  this  response 


Fig.  1.  The  SVHOE  structure,  showing  the  recording  of  a 
grating  with  wave  vector  Ko  by  means  of  two  coherent, 
collimated  recording  beams. 
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curve  is  inversely  proportional  to  the  grating  thick¬ 
ness:  a  thin  grating  has  a  broad  response  curve  indic¬ 
ative  of  Raman-Nath  regime  behavior,  whereas  a 
thick  grating  has  a  quite  narrow  response  curve  indica¬ 
tive  of  Bragg  regime  behavior.^ 

Comparative  angular  response  curves  for  several 
SVHOE  structures  are  shown  in  Figs.  2  and  3.  In 
these  figures,  the  recording  layers  in  each  thin  grating 
stack  are  assumed  for  simplicity  to  have  infinitesimal 
thickness  and  to  impose  only  phase  (as  opposed  to 
amplitude)  modulation.  When  the  structure  com¬ 
prises  several  thin  phase  modulation  layers,  the  sum  of 
the  phase  modulation  for  all  layers  has  been  set  equal 
to  that  of  the  reference  case  of  the  uniform  volume 
grating  (Fig.  2,  top)  and  is  divided  equally  among  all 
the  thin  recording  layers.  The  thickness  of  each  opti¬ 
cally  passive  buffer  layer  is  adjusted  such  that  the 
total  device  thickness  is  equal  to  that  of  the  thick 
uniform  volume  grating.  Finally,  the  index  of  refrac¬ 
tion  of  the  buffer  layers  has  been  set  equal  to  the 
average  index  of  the  modulation  layers. 


Fig.  2.  Angular  response  for  a  distributed  thick  grating 
(top),  for  a  single  thin  grating  (horizontal  line,  bottom),  and 
for  a  pair  of  thin  gratings  (bottom). 


Consider  first  a  single  thin  grating  with  modulation 
strength  equal  to  that  of  the  bulk  grating.  Such  a  thin 
grating  has  negligible  angular  alignment  sensitivity,  as 
shown  by  the  horizontal  line  in  the  bottom  half  ot  Fig. 
2.  The  angular  alignment  sensitivity  of  a  two-grating 
structure,  however,  exhibits  essentially  a  sinusoidal 
oscillation,  as  shown  in  the  bottom  half  of  Fig.  2,  with 
an  angular  period  defined  by  the  ratio  of  the  grating 
period  to  the  thickness  separating  the  two  gratings. 
(Note  that  this  structure  has  potentially  interesting 
applications  as  a  high-resolution  angle  encoder.) 

A  stack  of  five  thin  gratings,  as  shown  in  Fig.  3, 
suppresses  three  of  every  four  peaks  that  appear  when 
only  the  two  outer  gratings  are  present.  In  general,  a 
stack  of  N  thin  gratings  exhibits  an  angular  alignment 
sensitivity  that  is  periodic,  with  a  period  of  iV  -  1  two- 
grating  peaks,  in  which  iV  -  2  of  the  two-grating  peaks 
have  been  suppressed  by  the  interior  gratings. 

For  a  given  smgular  interval,  the  incorporation  of  a 
sufficiently  large  number  of  thin  gratings  leads  to  a 
structure  with  an  angvilar  alignment  sensitivity  that  is 
indistinguishable  from  that  of  the  bulk  grating.  Thus 
a  SVHOE  structure  can  emulate  closely  the  Bragg 
response  of  a  bulk  grating,  using  surprisingly  few  thin 
grating  layers.  In  addition,  this  structure  exhibits 
features  not  found  in  bulk  gratings  that  may  prove 
useful  for  certain  system  applications.  For  example, 
illumination  of  a  SVHOE  structure  with  focused 
monochromatic  light  produces  uniformly  spaced  an¬ 
gular  response  petdcs,  which  can  be  transformed  with  a 
lens  to  produce  an  array  of  equally  spaced  points. 
This  function  is  useful  for  interconnection  of  optical 
cellular  logic  arrays.  For  example,  illumination  with 
850-nm  light  of  a  two-grating  SVHOE  with  a  grating 
separation  of  1  cm  and  a  grating  spatial  frequency  of 
350  cycles/mm  at  F/3.3  will  produce  in  excess  of  1000 
regularly  spaced  diffracted  beams. 

Further,  for  real-time  material  implementations  of 
SVHOE  structures  in  which  the  photoresponse  for 
grating  formation  can  be  either  optically  or  electrically 
switched  on  a  layer-by-layer  basis,  a  structure  with  a 
given  number  of  layers  can  interconnect  either  every 
element  or  every  pth  element  with  the  source,  in  a 
programmable  manner.  For  example,  such  layer-by- 
layer  programmability  may  prove  feasible  by  either 
optical  bleaching^  or  electrical  tuning^  of  the  exciton 
resonance  in  a  multiple  quantum  well  structure  or  by 
modulation  of  a  doping  superlattice.^  SVHOE  struc¬ 
tures  can  also  be  conceived  that  exhibit  entirely  differ¬ 
ent  diffraction  behavior  in  response  to  grating  forma¬ 
tion  by  distinct  writing  wavelengths,  by  actively  alter¬ 
ing  the  layer  photosensitivity  spectrum  on  a  layer-by- 
layer  basis.  These  features  collectively  allow  for 
additional  degrees  of  freedom  in  the  formulation  of 
both  passive  and  active  holographic  elements. 

Several  simple  experiments  have  been  performed  to 
demonstrate  and  verify  the  SVHOE  concept.  Phase 
gratings  have  been  recorded  in  positive  photoresist 
(Shipley  1450J)  deposited  upon  microscope  cover 
glass  pieces  (22  mm  X  22  mm,  approximately  200  urn 
thick).  The  grating  frequency  was  adjusted  to  be  74 
cycles/mm,  as  recorded  by  a  442-nm  laser  interferome¬ 
ter.  Readout  was  performed  by  a  633-nm  laser  beam, 
which  probed  either  individual  gratings  or  stacks  of 
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Fig.  4.  Angular  response  for  three  gratings,  each  spaced 
from  the  next  by  a  single  glass  plate  thickness:  experiment 
(solid  curve)  and  computer  model  (dashed  curve). 


gratings.  Gratings  were  exposed  individually,  pro¬ 
cessed,  and  then  assembled  using  moire  imaging  tech¬ 
niques  to  achieve  proper  alignment  both  in  angle  and 
in  linear  translation  with  respect  to  each  other. 

The  angular  response  has  been  measured  and  com¬ 
pared  with  theoretical  calculations  for  the  following 
cases:  (Da  single  grating,  (2)  two  gratings  separated 
by  200  nm  (the  thickness  of  a  single  glass  plate),  (3) 
two  gratings  separated  by  400  ftm,  (4)  two  gratings 
separated  by  800  Min,  and  (5)  three  gratings,  each  sepa¬ 
rated  from  the  next  by  200  Mm,  for  a  total  device 
thickness  of  400  Mm.  In  all  cases,  the  agreement  be¬ 
tween  experiment  and  theory  is  excellent.  Represen¬ 
tative  experimental  and  theoretical  curves  for  the 
three-grating  case  are  shown  in  Fig.  4. 

The  peak  diffraction  efficiency  of  a  SVHOE  struc¬ 
ture  is  a  function  not  only  of  the  modulation  strength 
of  each  grating  layer  but  also  of  the  spacing  between 
the  layers.  A  single  thin  phase  grating  can  diffract  no 
more  than  33.9%  of  the  incident  readout  light  power 
into  the  -t-lst  diffraction  order,  assuming  a  sinusoidal 
grating  profile.  A  two-grating  structure  can  diffract 
as  much  as  67.7%,  as  discussed  by  Zel’dovich  et  al.,^  or 
as  little  as  22.2%,  assuming  the  same  grating  strength 
for  each  layer  but  changing  only  the  buffer  layer  thick¬ 
ness.  A  three-grating  structure  can  diffract  as  much 
as  87.0%,  a  four-grating  structure  in  excess  of  90%,  and 
a  five-grating  structure  more  than  95%,  if  the  buffer- 
layer  thickness  is  chosen  to  be  optimum  in  each  case. 
Maximum  diffraction  efHciency  (at  least  for  grating 
strengths  ranging  from  zero  to  the  flrst  maximum  in 
diffraction  efficiency)  occurs  when  the  thickness  of 
each  buffer  layer  is  given  by 

^buffer  *  ^buffsAA.*  *  2ir(m  +  1/2),  (1) 

in  which  X  is  the  light  wavelength,  Dbuffcr  is  the  buffer- 
layer  thickness,  n  is  the  index  of  refraction  of  the 
buffer  layers,  A  is  the  period  of  the  grating,  and  m  is  an 
integer.  Thus,  neglecting  the  thickness  of  individual 


grating  layers  and  counting  only  the  buffer  layers,  the 
optimum  total  thickness  Dtow  for  a  structure  with  N 
gratings  is  given  by 

Qtoul  =  =  2ir(m  -I-  l/2)(N  -  1).  (2) 

Note  that  the  broadest  spatial-frequency  response  oc¬ 
curs  when  the  smallest  optimal  thickness  is  employed, 
i.e.,  when  Qtotai  -  (N  -  l)ir. 

The  angular  response  analysis  shown  in  Figs.  2  and  3 
has  assumed  infinitesimally  thin  recording  layers. 
The  effect  of  finite  recording-layer  thicknesses  is  to 
introduce  an  angular  response  rolloff  associated  with 
any  single  layer,  which  in  turn  serves  as  an  envelope 
function  to  modulate  the  angular  response  of  more 
intricate  structures  constructed  from  multiple  layers. 
In  fact,  the  angular  response  characteristic  is  just  the 
Fourier  transform  of  the  depth  distribution  of  the 
grating  strength.®  Thus  for  certain  applications  one 
design  contraint  might  be  an  upper  bound  on  the 
thickness  of  each  recording  layer  to  keep  the  overall 
response  rolloff  envelope  sufHciently  broad. 

We  have  demonstrated  that  SVHOE’s  can  satisfac¬ 
torily  emulate  distributed  bulk  gratings  over  a  pre¬ 
scribed  angular  interval.  However,  even  more  in¬ 
triguing  are  those  attributes  unique  to  SVHOE’s  that 
suggest  new  device  applications.  Specifically,  three 
features  have  been  identiHed  to  date:  (1)  the  ability 
to  control  the  recording  sensitivity  of  individual  re¬ 
cording  layers,  independent  of  adjacent  layers,  either 
‘optically  or  electrically,  thus  altering  the  diffraction 
characteristics  of  the  device;  (2)  the  ability  to  record 
holograms  with  distinct  diffraction  characteristics  at 
two  or  more  wavelengths,  also  through  the  use  of  elec¬ 
trical  or  optical  control  on  a  layer-by-layer  basis,  and 
(3)  the  existence  of  periodic  multiple  response  peaks 
in  the  angular  response  proHles. 
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Introduction 

Over  the  past  four  decades,  the 
growth  of  iiuonnation  processing  and 
computational  capacity  has  been  truly 
remarkable,  paced  to  a  large  extent  by 
equally  remarkable  progress  in  the  inte¬ 
gration  and  ultra-miniaturization  of 
semiconductor  devices.  And  yet  it  is 
becoming  increasingly  apparent  that 
currently  envisioned  electronic  proces¬ 
sors  and  computers  are  rapidly 
approaching  technological  barriers  that 
delimit  processing  speed,  computa¬ 
tional  sophistication,  and  throughput 
per  unit  ^ssipated  power.  This  realiza¬ 
tion  has  in  turn  led  to  intensive  efforts 
to  circumvent  such  bottlenecks  through 
appropriate  advances  in  processor 
architecture,  multiprocessor  distributed 
tasking,  and  software-defined 
algorithms. 

An  alternative  strategy  that  may  yield 
significant  computational  enhance¬ 
ments  for  certain  broad  classes  of  prob¬ 
lems  involves  the  utilization  of  multi¬ 
dimensional  optical  components 
capable  of  modulating  and/’or  redirect¬ 
ing  information-carrying  light  wave- 
fronts.  Such  an  optical  processing  or 
computing  approach  relies  for  its  com¬ 
petitive  advantage  principally  on  mas¬ 
sive  parallelism  in  conjunction  with 
relative  ease  of  implementation  of  com¬ 
plex  (weighted)  interconnections 
among  many  (perhaps  simple)  process¬ 


ing  elements.  A  wide  range  of  computa¬ 
tional  problems  exist  tlut  lend  them¬ 
selves  quite  naturally  to  optical 
processing  architectures,  including  pat¬ 
tern  recognition,  earth  resources  data 
acquisition  and  analysis,  texture  dis¬ 
crimination,  synthetic  aperture  radar 
(SAR)  image  formation,  radar  ambiguity 
function  generation,  spread  spectrum 
identification  and  analysis,  systolic 
array  processing,  phased  array  beam 
steering,  and  artificial  (robotic)  vision. 
In  addition,  many  neural  network  pro¬ 
cesses  that  inherently  rely  on  intricate 
interconnection  patterns  have  been  or 
can  be  implemented  optically.  These 
and  other  applications  are  treated  in 
more  detail  in  accompanying  articles  in 
this  special  issue  of  the  MRS  Bulletin'-^ 
and  in  special  issues  of  IEEE  Proceed¬ 
ings^  and  Optical  Engineering*  on  optical 
computing. 

A  generalized  optical  processor  or 
computer  can  be  depicted  schematically 
as  shown  in  Figure  1.  The  physical  con¬ 
stituent  elements  of  such  a  system 
include  a  central  processing  unit  (CPU) 
that  performs  the  essential  imple- 
mentable  function,  a  data  management 
processor  that  orchestrates  the  flow  of 
data  and  sequence  of  operations  (usu¬ 
ally  considered  part  of  the  CPU  in  a  tradi¬ 
tional  electronic  computer),  several 
types  of  memory  elements  for  both 
short-term  and  long-term  data  storage 


and  buffering,  foranat  devices  to  spa¬ 
tially  organize  input  data  flelds,  input 
devices  to  convert  data  input  types  to  a 
form  amenable  to  subsequent  process¬ 
ing,  output  devices  to  convert  pro¬ 
cessed  results  to  detectable  and  inter¬ 
pretable  forms,  and  detectors  to 
produce  externally  addressable  results. 
In  Figure  1,  feedback  interconnects  are 
explicitly  shown  as  separate  compo¬ 
nents  to  emphasize  their  crucial  role  in 
implementing  parallel  iterative 
algorithms  and  complex  weighting 
functions. 

A  wide  variety  of  optical  components 
are  required  to  implement  processors 
based  on  the  generalized  architecture 
shown  in  Figure  1.*  These  include  one- 
and  two-dimensional  spatial  light  mod¬ 
ulators,  volume  holographic  optical  ele¬ 
ments,  threshold  arrays,  optical 
memory  elements,  sources,  source 
arrays,  detectors,  and  detector  arrays. 
The  state  of  the  technology  is  such  that 
while  demonstration  devices  and  proto¬ 
types  are  proliferating,  with  the  excep¬ 
tion  of  sources,  detectors  and  detector 
arrays,  few  (if  any)  such  components 
have  as  yet  achieved  significant  com¬ 
mercial  success  or  even  demonstrated 
technological  viability.  In  large  part,  this 
is  due  to  the  fact  that  each  of  the  candi¬ 
date  technologies  has  placed  rather 
severe  demands  on  the  state-of-the-art 
of  the  materials  employed  regardless  of 
the  nature  of  the  optical  effect  utilized 
(e.g.,  electrooptic,  magnetooptic,  pho- 
torefractive,  or  electroabsorptive).  In 
other  words,  the  magiutudes  of  observ¬ 
able  optical  perturbations  per  urut  exci¬ 
tation  are  just  not  large  enough  with 
readily  available  materi^  to  allow  flex¬ 
ible  device  engineering.  As  such,  the 
answer  to  whether  optical  processing 
and  computing  will  come  of  age  may 
ultimately  rest  on  the  capabilities  and 
fortunate  discoveries  of  materials  scien¬ 
tists  and  process  engineers. 

It  is  of  considerable  interest,  nonethe¬ 
less,  to  examine  the  physical  as  well  as 
technological  limitations  that  apply  to 
optical  information  processing  and  com¬ 
puting  systems  in  order  to  assess  their 
potential  performance  advantages  (if 
any)  over  comparable  electronic  coun¬ 
terparts,  and  to  provide  much-needed 
guidance  for  continued  research  efforts 
in  optical  materials,  devices,  algorithms, 
and  system  architectures.  Although  the 
study  of  fundamental  physical  limita¬ 
tions  in  the  context  of  digital  (binary) 
electronic  systems  (particularly  VLSI)  is 
well  established,*'  it  should  be  noted 
that  comparable  studies  of  optical  pro¬ 
cessing  and  computing  systems  are  rela- 
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tively  recent,  and  have  to  date  focused 
priznari^  on  dimtal  optical  computing  as 
opposed  to  analog  optical  processing.*'** 
The  remainder  of  this  article  will 
describe  several  such  physical  and  tech¬ 
nological  limitations  of  both  optical 
mformation  processing  and  computing. 
The  following  section  addresses  the 
nature  of  computation  from  the  per¬ 
spective  of  identifying  physical  {technol¬ 
ogy-independent)  limts.  The  next  section 
will  then  provide  an  example  of  a  tech¬ 
nology-dependent  limit  in  the  context  of 
photorefractive  volume  holographic 
optical  interconnections.  Condusions 
are  drawn  in  a  final  section. 


The  Nature  of  Computational 
Constraints 

In  order  to  implement  any  computa¬ 
tional  algorithm  on  a  machine  of  given 
architecture  and  component  hardware, 
we  must  first  choose  a  representation  for 
inputting  data  and  executing  computa¬ 
tional  steps.  Such  a  representation 
might,  for  example,  be  digital,  analog, 
or  even  symbolic.  As  we  shall  see,  such 
a  choice  of  representation  has  profound 
implications  on  the  physical  l^ts  that 
apply  to  subsequent  computation. 


For  a  given  representation,  we  can 
then  assess  the  computational  complexity 
of  the  chosen  algorithms.  We  define  the 
computatiorud  complexity  of  an  opera¬ 
tion  as  the  equivalent  number  of  irre- 
dudble  binary  switching  operations 
required  to  pe^rm  the  same  operation 
in  the  most  effident  manner  possible. 
For  example,  the  relatively  simple 
operation  of  transferring  1,000  ten-bit 
words  from  the  CPU  to  memory  will 
require  at  the  very  least  lO*  binary 
switching  operations  (assuming  that  all 
of  the  data  is  transferred  in  parcel),  and 
most  likely  considerably  more  if  shift 
registers  are  employed  to  multiplex  the 
data  transfer.  This  definition  of  compu¬ 
tational  complexity  provides  a  conve¬ 
nient  means  for  comparing  the  overall 
effidency  or  total  minimum  energy  cost 
of  a  given  computation  performed  by 
different  algorithms  on  a  machine  of 

given  architecture  and  technology,  or 
y  machines  based  on  vastly  different 
architectures  and  perhaps  disparate 
technological  hardware.  It  does  not, 
however,  take  into  account  the  inter¬ 
connection  complexity  required  to 
implement  communications  pathways 
at  the  device,  drcuit,  and  subsystem 
levels. 


Finally,  we  must  end  each  computa¬ 
tion  with  a  detection  of  the  desired 
results,  in  order  to  aOow  the  result  to  be 
used  (for  example,  to  implement  a 
desired  action).  This  separation  of  every 
computational  process  into  the  imple¬ 
mentation  of  representation,  computa¬ 
tional  complexity,  and  detection  func¬ 
tions  allows  us  to  establish  sets  of 
interrelated  limib  that  apply  to  various 
combinations  of  choices. 

For  purposes  of  discussion  in  this 
artide,  let  us  examine  each  of  these 
functions  from  the  point  of  view  of 
energy  cost.  In  so  doing,  we  seek  to 
identify  physical  limits  on  the  maximum 
computational  throughput  achievable 
per  watt  of  dissipated  power.  It  should 
perhaps  be  noted  here  that  the  "require¬ 
ment"  of  energy  dissipation  is  not  in 
fact  a  fundamental  limit,  and  derives 
instead  from  a  system  design  demand 
to  assert  each  stored  value  as  rapidly  as 
possible  in  order  to  complete  the  entire 
computation  deterministically  and  in 
minimum  time.  Thus  adiabatic  compu¬ 
tational  processes  which  do  not  require 
a  minimum  energy  dissipation**  are  not 
applicable  to  the  present  discussion. 

In  the  digital  (binary)  realm,  the  low¬ 
est  possible  energy  cost  of  representing 
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a  number  requiring  a  given  number  of 
bits  is  equal  to  the  number  of  bits  times 
the  minimum  energy  to  store  a  single 
bit.  For  semiconductor  electronic 
switches,  the  minimum  energy  in  turn 
is  equal  to  a  few  tens  of  kgT  per  elec¬ 
tron*  ’  times  the  number  of  electrons 
required  to  guarantee  detection  with  a 
given  bit  error  rate  (BER)  [a  quantum 
rather  than  a  thermal  limit].  For  a  bit 
error  rate  of  10*’  or  so,  about  ten  elec¬ 
trons  are  required  (assuming  an  "ideal" 
detector  [or  following  swit^j  that  can 
unambiguously  differentiate  between 
the  presence  and  absence  of  a  single 
electron).  Therefore  each  switching 
event  (or  representation  of  each  bit) 
requires  the  dissipation  of  approxi¬ 
mately  200  kgT.  This  places  an  inunedi- 
ate  upper  boundary  on  the  maximum 
computational  throughput  of  such  an 
electronic  digital  (binary)  computing 
engine  of  approximately  i0“  transitions 
(irreducible  binary  switching  opera¬ 
tions)  per  second  per  watt  at  room  tem¬ 
perature.  Note  that  this  bound  does  not 
include  the  assessment  of  any  cost  for 
internal  communication  of  information, 
as  would  be  required  to  execute  an 
actual  computation. 

To  place  this  number  in  proper  per¬ 
spective,  we  need  only  look  at  the 
switching  energies  representative  of 
current  semiconductor  technology.  For 
electronic  devices,  we  might  examine  a 
t}’pical  CMOS  capacitor  with  a  1(X)  ^m^ 
cross-sectional  area  and  a  1,000  A  oxide 
thickness,  for  which  the  charge/dis- 
charge  cycle  consumes  CV^  worth  of 
energy,  if  we  operate  at  the  minimum 
switching  voltage  of  a  few  tens  of 
ksT/q,*'  the  sudtdung  energy’  is  about 
10®  kgT  at  room  temperature,  about  four 
orders  of  magnitude  above  the  fun¬ 
damental  limit.  Current  digital  logic  cir¬ 
cuits  operate  at  roughly  2.5  x  10*'*-  ]/ 
transition,  which  is  about  10“  kgT.'  This 
is  also  roughly  the  minimum  energy 
required  to  operate  even  a  local  inter¬ 
connection  at  GHz  rates,’*  an  unavoid¬ 
able  cost  of  communication  between 
deuces  at  the  circuit  level.  At  the  sys¬ 
tem  level,  a  vast  amount  of  additional 
overhead  comes  into  play.  For  example, 
the  DEC  V.A,\  11-750  dissipates  approxi¬ 
mately  3  kW  running  fully  loaded  at  its 
maximum  throughput  rate  of  750,000 
instructions /second  (200,000  opera¬ 
tions/second).  Thus  the  energy  required 
to  perform  a  single  instruction  is  about 
3  mj  or  10'*  kgT.  which  corresponds  to  a 
throughput  rate  of  250  instructions/ 
second/ watt. 

By  comparison,  analog  representa¬ 
tions  (as  used  extensively,  for  example. 


in  optical  processors)  require  for  more 
energy  than  the  binary  equivalent.  This 
is  due  to  the  necessity  of  utilizing  a 
much  higher  particle  count  (electrons  or 
photons)  in  order  to  minimize  the 
effects  of  quantum  statistical  fluctua¬ 
tions  on  the  BER.  For  example,  if  we 
wish  an  analog  representation  of  the 
number  1,000,  then  we  require  a 
dynamic  range  of  at  least  1,000:1.  For 
incoherent  illumination,  quantum  fluc¬ 
tuations  in  the  emission/detection  pro¬ 
cess  produce  a  photon  number  distribu¬ 
tion  with  a  relative  standard  deviation 
of  O'  a  VN/N.  The  equivalent  of  a  10*’ 
BER  for  the  digital  case  corresponds  to 
roughly  12  standard  deviations.  There¬ 
fore,  the  number  of  photons  required 
must  be  greater  than  1.5  x  10*  from  sta¬ 
tistical  considerations  alone.  For  a  GaAs 
semiconductor  laser  characterized  by  a 
photon  energy  of  ~1.5  eV,  this  corre¬ 
sponds  to  about  10‘®  kgT.  To  represent 
1,000  optically  in  binary  requires 
approximately  14  bits  (10  bits  for  the 
number  plus  4  bits  of  overhead)  at  15  eV 
each  (10  photons  at  1.3  eV  each,  assum¬ 
ing  direct  detection  and  an  ideal  detec¬ 
tor),  or  about  10*  kgT. 

Given  the  remarkably  higher  cost  of 
analog  representation  as  compared  with 
digital,  any  competitive  advantages  for 
analog  systems  from  the  perspective  of 
an  ener^’  dissipation  metric  due  to 
physical  limits  must  come  from 
enhanced  computational  complexity. 
The  various  tradeoffs  involved  can  per¬ 
haps  best  be  described  in  terms  of  an 
example.  Consider,  then,  a  highly  inter¬ 
connected,  nonlocal  problem  such  as 
the  Fourier  transformation  of  a  two- 
dimensional  function.  Assume  that  the 
function  is  sampled  on  a  1,000  x  1,000 
element  array  at  10  bits. 

If  we  operate  in  the  binary  regime,  we 
can  utilize  a  highly  efficient  discrete 
Fourier  transform  (DFT)  routine  such  as 
the  Cooley-Tukey  algorithm.  The  num¬ 
ber  of  complex  operations  (multiplies 
and  adds)  scales  as  1.5/VTog;iV-  for  an 
N  X  .V  input.  If  we  assume  an  electronic 
machine  implementation  that  requires 
20  s^Nitching  events/bit/ complex  opera¬ 
tion,  then  approximately  10'"  srvitching 
events  are  required  to  perform  the  com¬ 
putation,  or  10  nj  for  operation  near  the 
physical  limits  described  above. 

If.  on  the  other  hand,  the  input  data 
field  (hvo-dimensional  function)  is  rep¬ 
resented  in  analog  form  by  means  of  a 
spatial  light  modulator,  illumination  by 
a  coherent  wavefront  will  produce  the 
required  transform  in  the  back  focal 
plane  of  a  (following)  lens.  Here  we  see 
illustrated  the  notion  of  a  computation 


as  the  iransformatian  of  information.  The 
energy  cost  of  this  particular  operation 
accrues  only  to  the  initial  representation 
of  the  function  (essentially,  the  optical 
or  electronic  addressing  of  the  spatial 
light  modulator)  and  to  the  subsequent 
detection  of  the  final  result.  This  is  then 
equivalent  to  2  x  10*  individual  pixel 
detection  operations  for  the  optically 
addressed  case,  or  70  /ij  at  the  physical 
limits  for  1.5  eV  photons,  still  about  four 
orders  of  magnitude  larger  than  the 
binary  equivalent.  The  computational 
complexity  implemented  by  the  analog 
optical  processor  is  thus  seen  to  par¬ 
tially  offset  the  large  initial  difference  m 
representation  energy  costs.  For  other 
classes  of  problems  with  even  higher 
inherent  computational  complexity',  the 
physical  boundary’  on  energy’  cost  may 
thus  favor  an  analog  approach. 

The  situation  loofc  quite  a  bit  differ¬ 
ent  if  we  examine  current  technological 
(rather  than  more  fundamental)  con¬ 
straints.  For  current  digital  circuits  that 
switch  at  around  2.5  pj/bit,  the  DFT  just 
described  dissipates  25  mJ  for  the 
required  number  of  switching  opera¬ 
tions  alone.  Current  digital  systems 
such  as  the  VAX  11-750  consume  about 
10~*  I/bit,  or  1  MJ  for  the  DFT.  With 
regard  to  optical  systems,  spatial  light 
m^ulators  are  available  with  input  sen¬ 
sitivities  of  about  200  pj/pixel  at  high 
signal-to-noise  ratio,  and  CCD  detector 
arrays  dissipate  approximately  1  m), 
Mpixel  on  readout.  Hence  the  analog 
optical  Fourier  transform  can  be  per¬ 
formed  for  an  energy  cost  of  about  1  1 
mJ.  This  apparent  capability  of  analog 
optical  systems  to  generate  significanth 
enhanced  computational  throughput 
per  unit  input  power  is  thus  due  largeiv 
to  the  fact  that  currently  available  ana 
log  optical  components  operate  far 
closer  to  the  relevant  thermal  and  quan¬ 
tum  limits  than  current  digital  electronjc 
(VLSI)  components.  It  should  be  noted 
that  such  a  comparison  begs  the  q-ue- 
don  of  overall  computational  accura.-. 
which  clearly  favors  the  digital  imr.e 
mentations  due  to  nonlineanties  ana 
nonuniformities  in  currently  avaiiao.e 
analog  optical  components. 

The  third  principal  component  o; 
computational  process,  that  of  uetei. 
of  the  results  (whether  electron u  -r 
optical),  is  subject  to  the  same  ther:-a: 
and  quantum  statistical  limitations  i 
the  process  of  representation  T  i 
statement  follows  directly  from  i: 
assumption  inherent  in  assessing 
minimum  energy  required  for  rern  -  • 
tation  —  that  each  number  mu  •  • 

capable  of  detection  at  a  given  n: 
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assuming  an  ideal  detector.  Technolom- 
cal  constraints  can  then  be  added  to  the 
fundamental  Uinit  problem  as  before  to 
assess  realistic  current  capabilities.  For 
^.xample,  available  optical  detectors 
cypicaUy  require  a  mean  photon  number 
of  1,000  for  a  10~*  BER  in  the  direct 
detection  mode,  rather  than  the  value  of 
10  assumed  above. 

Volume  Holographic  Optical 
Interconnections 

As  pointed  out  in  the  Introduction 
(and  discussed  in  detail  in  Ref.  1),  the 
ability  to  provide  complex,  multidimen¬ 
sional,  programmable,  weighted  inter- 
cormections  by  means  of  volume  holo¬ 
grams  is  an  attractive  feature  of  optical 
processing  and  computing  systems.  In 
addition,  this  ability  perhiaps  more  so 
than  any  other  contributes  signihcant 
computational  complexity  to  optical 
architectures.  To  this  end,  it  is  of  con¬ 
siderable  importance  to  assess  the  fun¬ 
damental  limitations  that  apply  to  such 
volume  holographic  optical  intercon¬ 
nections. 

Although  many  types  of  photosensi¬ 
tive  media  can  be  utUked  for  the  record¬ 
ing  and  readout  of  volume  holograms, 
the  dynamic  reprogrammability  offered 
by  photorefractive  crystals  such  as 
lithium  niobate,  barium  titanate,  bis¬ 
muth  silicon  oxide,  and  gallium 
arsenide  has  made  such  materials  the 
objects  of  intensive  study.  In  these 
materials,  the  interference  between  two 
coherent  optical  wavefronts  (the  signal 
and  reference  beams)  generates  a  space- 
variant  photoexdtation  distribution  that 
produces  in  turn  a  related  space  charge 
redistribution  and  associated  electric 
held  pattern  by  carrier  drift  and  diffu¬ 
sion.  Many  interesting  limits  apply  to 
such  a  process,  including  the  ma,ximum 
number  of  independent  interconnec¬ 
tions  that  can  be  sequentially  or  simulta¬ 
neously  recorded  at  a  given  value  of 
allowable  crosstalk,  the  highest  achiev¬ 
able  diffraction  efficiency  of  each  inde¬ 
pendent  interconnection  at  the  maxi¬ 
mum  interconnection  density,  the 
absolute  uunimum  number  of  photoev¬ 
ents  per  unit  volume  required  to  record 
an  interconnection  of  given  analog 
weight  within  the  quantum  fluctuation 
limits  for  statistical  accuracy  of  record¬ 
ing  and  reconstruction  (readout),  and 
the  maximum  asymmetry  possible 
between  the  recording  and  erasure  pro¬ 
cesses.  An  additional  limit  of  consider¬ 
able  importance  is  that  of  the  photore¬ 
fractive  sensiti\ily,'*’‘*  or  the  refractive 
index  modulation  obtained  in  recording 
a  uniform  grating  of  Hxed  spatial  fre¬ 


quency  per  unit  (incident  or  absorbed) 
energy  density,  as  this  parameter  places 
an  upper  bound  on  the  maximum  rate 
of  intercormection  reprogrammings  that 
can  be  accomplished  per  unit  average 
power.  In  what  follows,  the  fundamen¬ 
tal  physical  limitations  on  the  photore¬ 
fractive  sensitivity  are  examined  in  a  bit 
more  detail. 

A  number  of  factors  contribute  to  the 
photorefractive  sensitivities  characteris¬ 
tic  of  photoconductive,  electrooptic 
materials.  One  such  factor  is  the  photo¬ 
generation  quantum  efficiency,  which 
represents  the  number  of  photogener¬ 
ated  mobile  charge  carriers  per  photon 
absorbed  from  the  recording  beam(s).  A 
second  factor  is  the  charge  transport 
efficiency,  which  is  a  measure  of  the 
degree  to  which  the  average  photogen¬ 
erated  mobile  charge  carrier  contributes 
to  the  forming  space  charge  grating 
after  separation  from  its  original  site  by 
means  of  drift  and/or  diffusion  and  sub¬ 
sequent  trapping.  The  magnitude  of  the 
space  charge  field  generated  by  a  given 
space  charge  grating  is  inversely  pro¬ 
portional  to  the  dielectric  permittivity  e 
of  the  photorehactive  material,  which 
thus  contributes  a  third  factor  to  the 
grating  recording  sensitivity.  A  fourth 
ractor  describes  the  perturbation  of  the 
local  index  ellipsoid  (dielectric  tensor  at 
optical  frequencies)  that  results  from  a 
given  space  charge  field  through  the 
electrooptic  frequencies)  that  results 
given  space  charge  field  through  the 
electrooptic  (Pockels  or  Kerr)  effect. 

In  addition,  several  other  physical 
quantities  factor  into  an  evaluation  of 
the  photorefractive  sensitivity,  includ¬ 
ing  the  wavelength  of  the  recording  illu¬ 
mination  (to  convert  the  number  of 
absorbed  photons  into  an  equivalent 
energy'),  the  absorption  coefficients  of 
the  material  at  the  wavelengths  of  both 
the  recording  and  readout  beams  (to 
correct  for  the  fractionai  absorbance  of 
the  recording  beams  and  the  fractional 
transmittance  of  the  readout  beam),  and 
the  magnitude  of  the  applied  voltage 
(which  significantly  alters  the  sensitivity 
characteristics  for  certain  materials  by 
changing  the  nature  of  the  dominant 
charge  transport  mechanism  from  the 
diffusion  regime  to  the  drift  regime). 

In  order  to  provide  a  quantitative 
metric  that  does  in  fact  have  a  fun¬ 
damental  physical  limitation,  we  define 
the  grating  recording  efficiencj  ”  of  a  pho¬ 
torefractive  recording  model  or  configu¬ 
ration  as  the  magnitude  of  the  space 
charge  field  produced  by  a  fixed  num¬ 
ber  of  photogenerated  mobile  charge 
carriers  at  a  given  spatial  frequency. 


normalized  by  the  maximum  quantum 
limited  space  charge  field  that  can  be 
produced  by  the  same  number  of  photo¬ 
generated  carriers.  This  metric  thus 
effectively  combines  the  notions  of  a 
photogeneration  effidenc/  and  a  charge 
transport  effidency,  and  provides  an 
estimate  of  the  fundamental  quantum 
effidency  of  the  photorefractive  grating 
recording  process.  For  simplicity,  we 
confine  our  attention  here  to  a  photore¬ 
fractive  model  characterized  by  a  single 
mobile  charge  spedes,  and  a  smgie  type 
of  donor  site  with  associated  un-ionized 
donor  and  ionized  donor  (trap;  states. 

Let  us  compare  the  grating  recording 
efficiencies  of  four  idealized  grating 
recording  models,  generated  by 
combining  two  types  of  photogenera¬ 
tion  profiles  (a  comb  function  and  a 
sinusoid)  with  two  types  of  charge 
transport  processes  (a  translation  of 
each  photogenerated  charge  carrier  by 
exactly  half  of  a  grating  wavelength; 
and  uniform  redistribution,  or  random¬ 
ization,  of  all  photogenerated  charge). 
The  resulting  four  combinations  are 
shown  schematically  in  Figure  2. 

The  maximum  quantum  limited  space 
charge  field  that  can  result  from  a  ^ed 
number  of  photogenerated  mobile 
charges  is  given  by  the  bipolar  comb 
distribution,  which  generates  a  square 
wave  electric  field  profile  fiom  cJtemat- 
ing  positive  and  negative  sheets  of 
charge  spaced  by  half  of  a  grating  wave¬ 
length.  It  is  in  fact  the  fet  harmonic 
component  of  this  electric  field  profile 
that  we  are  interested  in,  as  '.ve  assume 
a  volume  grating  operated  deep  within 
the  Bragg  diffraction  regime,  me  bipo¬ 
lar  comb  distribution  can  thus  be 
assigned  a  grating  recording  effidenc}' 
of  unity. 

By  comparison,  the  monopolar  comb, 
transport-effident  sinusoid,  and  base¬ 
line  sinusoid  cases  yield  grating  record¬ 
ing  efficiencies  of  1/2,  1/2.  and  1/4, 
respectively.  These  results  are  summa¬ 
rized  in  Figure  3,  which  shows  the  evo¬ 
lution  of  the  first  harmonic  component 
of  the  space  charge  field  for  all  four 
cases  as  a  function  of  grating  recording 
time. 

The  saturation  behador  of  each  case 
derives  from  assuming  a  fi,\ed  finite  trap 
density  prior  to  grating  recording.  Since 
the  resultant  diffraction  efficiencies  of 
such  gratings  scale  as  the  square  of  the 
space  charge  field  (for  small  modula¬ 
tion),  these  results  imply  that  the  base¬ 
line  sinusoid  distribution  will  exhibit  a 
diffraction  efficiency  lower  than  the 
bipolar  comb  distribution  by  a  factor  of 
16.  This  conclusion  is  of  considerable 
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Figure  2.  Schematic  diagram  depicting 
the  illumination  profile  and  dtarge 
transport  assumptions  underlying  each  of 
the  four  idralrrrd  photorefractwe 
recording  models  discussed  in  the  text. 


Figure  3.  The  evolution  of  the  space 
charge  field  as  a  function  of  grating 
recording  time  for  each  of  the  four 
idealized  photorefractive  recording 
models.  The  nonnafization  parameter 
E,  “  eNjJte^Ca,  in  which  is  the 
maximum  available  trap  density  and 
Kc  is  the  grating  wavevector. 


interest  because  the  baseline  sinusoid 
case  can  be  shown**  to  represent  an 
asymptotic  upper  bound  on  the  grating 
recording  efficiency  predicted  by  more 
realistic  (rather  than  idealized)  photore¬ 
fractive  grating  recording  models  that 
assume  a  sinusoidal  illumination  pro¬ 
file.  In  fact,  several  photorefractive 
materials  have  been  demonstrated  to 
exhibit  photorefractive  sensitivities  that 
approach  the  upper  bound  represented 
by  the  baseline  sinusoid  case.** 

Nonetheless,  the  photorefractive 
grating  recording  process  is  from  this 


perspective  somewhat  quantum  ineffi¬ 
cient,  in  that  it  does  not  make  the  best 
possible  use  of  either  input  photons  or 
photogenerated  carriers.  In  view  of  the 
analysis  presented  above,  therefore,  it  is 
of  considerable  interest  to  imagine 
novel  engineered  photorefractive  mate¬ 
rials  with  donor  planes  spatially  sepa¬ 
rated  from  trap  planes,  illumiiuited  a 
comb-like  intensity  pattern  (which  can 
be  generated,  for  example,  by  utilizing 
stratified  volume  holographic  optical 
elements  (SVHOEs]®).  Such  materials 
could  find  applications  in  devices  based 
on  generation  of  a  carrier  grating  of 
given  spatial  frequency,  such  as  the 
Photorefractive  Incoherent-to-Coherent 
Optical  Converter  (PICOC).^* 

Furthermore,  it  should  be  noted  that 
the  limits  derived  for  intercormections 
based  on  photorefractive  materials  per¬ 
tain  to  a  particular  physical  mechanism 
involving  photoexdtation,  charge  trans¬ 
port,  and  electrooptically  induced  index 
changes.  The  importance  of  multiplexed 
intercormections  to  overall  schemes  for 
optical  processing  and  computing  sug¬ 
gests  continued  emphasis  on  the  search 
For  alternative  physical  effects  that  can 
provide  either  enhanced  sensitivity  or 
more  conveniently  implementable 
desirable  features  sudr  as  selective  era¬ 
sure.® 

Conclusions 

An  examination  of  both  the  fun¬ 
damental  physical  and  current  techno¬ 
logical  limitations  to  computational  per¬ 
formance  shows  that  in  terms  of 
throughput  per  unit  power,  digital 
(binary)  representations  (whether  elec¬ 
tronic  or  optical)  presently  operate 
many  orders  of  magnitude  away  from 
the  relevant  boundaries.  Analog  repre¬ 
sentations  and  detections  are  inherently 
much  more  power  consumptive  than 
digital  representations  and  detections, 
and  processing  schemes  based  on  ana¬ 
log  ^gorithms  must  demonstrate  con¬ 
siderably  enhanced  computational  com¬ 
plexity  prior  to  each  intermediate 
detection  plane  in  order  to  be  energy 
competitive  at  the  appropriate  thermal 
and  quantum  limits. 

Hybrid  architectures  and  algorithms 
(such  as  those  employed  by  numerous 
neural  network  models)  that  effectively 
combine  analog  representations  as 
weights  in  highly  multiplexed  parallel 
interconnections  with  binary  represen¬ 
tations  in  the  form  of  threshold  arrays 
(decision  planes)  may  ultimately  prove 
to  be  a  nearly  optimum  compromise. 
This  is  particularly  applicable  to  optical 
processing  and  computing  architec¬ 


tures,  for  which  available  analog  com¬ 
ponents  (spatial  light  modulators  and 
photorefractive  volume  holographic 
optical  elements)  operate  much  doser  to 
the  relevant  quantum  statistical  limits 
than  the  corresponding  digital  compo¬ 
nents  (threshold  arrays).  In  such  hybrid 
architectures,  the  threshold  array  per¬ 
forms  the  equivalent  of  a  parallel  level 
restore  function  (saturated  nonlinearity) 
that  is  essential  to  the  proper  conver¬ 
gence  of  many  processing  and  comput¬ 
ing  algorithms. 

Further  study  of  the  fundamental 
physical  limitations  that  apply  to  optical 
processing  and  computing  will  provide 
important  guidance  for  the  continued 
development  of  active  optical  materials, 
primarily  by  differentiating  between 
avenues  of  opportunity  with  large 
potential  gain  that  depend  on  key  mate¬ 
rials  parameters,  and  those  for  which 
current  materials  characteristics  are  suf¬ 
ficient  to  approach  relevant  perfor¬ 
mance  boundaries. 
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Bituuy  optics  coirecdon  o/  typical  lenses. 


iy  add  the  power  of  laser  diode  arrays  (modular  laser  pow¬ 
er),  2)  beam  steering  or  wavefront  manipulation,  and  3) 
opto-electronic  integration  of  imager  focal  planes. 

Thus  hur,  work  has  concentrated  on  the  technology  in¬ 
volved  in  designing  and  producing  individual  elements  for 
systems.  The  vast  potential  of  binary  optics  to  spawn  new 
products  and  industries  will  materialize  only  as  industrial 
expertise  develops,  system  designers  become  himiliar  with 
its  cost,  weight,  and  design  benefits,  and  manufacturers 
put  it  in  production  through  replication,  embossing,  and 
forging  or  molding  from  a  single  master  element. 


STRATIFIED  VOLUME  HOLOGRAPHIC 
OPTICAL  ELEMENTS 

R.V.  Johnson  AND  AJL  TancvayJr. 

Optkal  Mateuals  AND  Devkss  Laboratory,  AND 
CENmtiORPHcmsNK  technology 
Universtty  Of  Southern  Caufornia 
Los  Angeles,  Cauf. 

Holographic  optical  elements  are  of  fundamental  im¬ 
portance  to  a  number  of  applications  in  optical  in¬ 
formation  processing  and  computing,  including  optical  in¬ 
terconnections,  content-addre^ble  memories,  ^  vari¬ 
ous  linear  and  ixmlinear  signal  processing  configurations. 
Numerical  analyses  of  typical  volume  holographic  struc¬ 
tures  are  criticai  for  characterizing  and  (^mmizing  such 
optical  elements.  Oik  of  the  most  flexible  and  broadly  ap- 
plic^ile  toob  for  studying  the  difhaction  bdiavior  of  vol¬ 
ume  holograms  is  the  optical  beam  propagation  medrad 
(BPM).'*^  In  this  method,  the  distributed  optical  inhomo¬ 
geneities  diat  diaracterize  a  typical  hologram  are  ^roxi- 
mated  by  a  discrete  sequence  of  physically  and  madimati- 
cally  simplified  elements:  infinitesimally  thin  phase  and/or 
polarizatimi  modulation  layers,  interleaved  with  optically 


homogeneous  layers  of  finite  thickness.  By  approximating 
the  distributed  grating  with  a  suffidendy  large  number  of 
these  discrete  elements,  the  resulting  numerical  model  of 
die  grating  can  be  brought  arbitrarily  close  to  that  of  the 
desired  distributed  bulk  grating. 

The  BPM  concept  of  separadng  the  moduladon  process 
from  the  diffracdon  process  in  turn  suggests  a  new  class  of 
opdcal  devices:  the  stradfied  volume  holographic  opdcal 
element,  or  SVHOE^'"*  (see  figure).  The  SVHOE  device 
structure  consists  of  a  sequence  of  thin  photosensidve  ho¬ 
lographic  recording  layers  that  perform  the  opdcal  modu- 
laddn  funcdon,  interleaved  with  opdcally  passive  bufier 
layers,  i.e.,  layers  that  impress  no  moduladon  on  the  light 
beam  but  rather  allow  the  diffracdon  processes  necessary 
for  thick  gradng  response  to  occur.  A  grating  recorded  in 
any  individual  moduladon  layer  would  necessanly  exhibit 
Raman-Nath  charactensdcs  because  each  recording  layer 
is  quite  thin.  But  for  a  given  angular  interval,  the  incorpo- 
radon  of  a  sufficiently  large  number  of  thin  gratings  leads 
to  a  structure  with  an  angular  alignment  sensidvity  that  is 
indistinguishable  from  diat  of  the  bulk  gradng.  Thus,  an 
SVHOE  structure  can  closely  emulate  the  Bragg  response 
of  a  bulk  gradng,  using  surprisingly  few  thin  grating  lay¬ 
ers. 

In  addidon,  this  structure  exhibits  features  not  found  in 
bulk  gratings  that  may  prove  useful  for  ceitain  system  ap- 
plicadons.  For  example,  illuminadon  of  an  SVHOE  struc¬ 
ture  with  focused  monochromadc  light  produces  uniform¬ 
ly  spaced  angular  response  peaks,  whidi  can  be  trans¬ 
formed  with  a  lens  to  produce  an  array  of  equally  spaced 
points.  This  funcdon  is  useful  for  interconnecdon  of  opd¬ 
cal  cellular  logic  arrays.  For  example,  illuminadon  with 
850  nm  light  of  a  two  gradng  SVHOE  with  a  gradng  sepa- 
radon  of  1  cm  and  a  grating  spadal  frequency  of  350  cy¬ 
cles/mm  at  F/3.3  will  produce  in  excess  of  1000  regularly 
spaced  difoacted  beams.  The  occurrence  of  muldple  (pen- 
odic)  peaks  in  the  angular  response  profile  can  be  applied 
to  angular  encoding  applicadons,  in  which  a  two  grating 
SVHOE  mounted  on  foe  part  to  be  tracked  can  be  probed 
by  a  single  beam  to  allow  coarse  angular  measurement  by 
peak  counting,  and  fine  angular  measurement  by  interpo¬ 
lation.  Highly  selecdve  wavelengfo  notch  filtering  can  also 
be  accomplished  in  SVHOE  structures  by  appropriate 
choice  of  buffer  layer  foickness  and  gradng  spadal  fre¬ 
quency.* 

Further,  for  real-dme  material  implementadons  of 
SVHOE  structures  in  which  foe  photoie^nse  for  grating 
foRnadon  can  be  either  opdcally  or  electrically  switched 
on  a  layer-by-layer  basis,  a  structure  with  a  given  number 
of  layers  can  interconnect  either  every  element  or  every 
pfo  element  with  foe  source,  in  a  programmable  manner. 
For  example,  such  layer-by-layer  programmability  may 
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The  SVHOE  structure,  sbowiag  the  recording  of  a  grat¬ 
ing  with  wave  vector  Kq  by  means  of  two  coherent,  col- 
bmated  recording  beattts. 


prove  feasible  by  either  optical  bleaching  or  electrical  tun¬ 
ing  of  the  exciton  resonance  in  a  multiple  quantum  well 
structure,  or  by  modulation  of  a  doping  superlattice. 
SVHOE  structures  can  also  be  conceiv^  that  exhibit  en¬ 
tirely  different  diffraction  behavior  in  response  to  grating 
formation  by  distinct  writing  wavelengths,  by  actively  al¬ 
tering  die  layer  photosensidvity  plectrum  on  a  layer-by- 
layer  basis. 

These  novel  features  of  SVHOEs  ooUecdvely  allow  fr>r 
numerous  additional  degrees  of  freedom  in  the  formula¬ 
tion  of  both  passive  and  active  holographic  dements. 
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Optical  dau  at  a  bit  rate  of  16  Gb/s  were  recendy 
transmitted  over  optical  fiber  in  an  experiment  at 
ATficT  Bell  Laboratories.  This  is  the  highest  transmission 
bit-rate  reported  for  any  single-channel  lightwave  system 
— an  achievement  made  possible  using  optical  technology 
to  time-division  multiplex  and  demultiplex  the  data. 

Commercial  lightwave  transmission  systems  use  elec¬ 
tronic  time-division  multiplexing  (TDM)  to  combine  a 
number  of  electronic  data  channels  into  a  single  higher 
capacity  channel  for  transmission.  In  essence,  time-divi¬ 
sion  multiplexing  is  a  digital  technique  in  whidi  data  bits 
art  interleaved  in  time.  As  bit-rates  move  into  the  multi¬ 
gigabit  per  second  range,  the  speed  of  available  electronics 
IS  being  pushed  to  the  limit  and  the  electronic  multiplexing 


circuity  causes  a  speed  botdeneck.  The  optical  time-divi¬ 
sion  multiplexing  techniques  used  in  the  ^11  Laboratories 
experiments  retain  the  dgital  format  of  the  signal,  but 
overcome  the  speed  bottleneck.  This  is  achieved  by  capi¬ 
talizing  on  the  inherent  wide-band  capabilities  of  optical 
compopents  such  as  pulsed  semiconduaor  lasers  and  lithi¬ 
um  niobate  (LiNbOa)  directional  coupler  switches. 

In  the  16  Gb/s  optical  time-division  multiplexed  system, 
four  optical  dau  streams  at  4  Gb/s  are  time-multiplexed 
direedy  in  the  optical  domain  to  give  a  16  Gb/s  data 
stream  for  transmission.  At  die  receiver  end  of  die  system, 
the  16  Gb/s  dau  stream  is  demultiplexed  optically  to  four 
lower  bit-rate  optical  streams  before  detection  and  conver¬ 
sion  to  die  electrical  domain. 

The  4  Gb/s  input  dau  streams  originate  from  shon  op¬ 
tical  pulses  generated  by  four  mode-locked  semiconductor 
lasers.  Data  are  encoded  on  the  pulses  using  LiNbOi 
waveguide  optical  modulators.  Optical  multiplexing  i> 
carried  out  by  simply  delaying  the  optical  pulses  streams 
and  combining  them  in  a  pasave  fiber  dire^onal  coupler 
array. 

The  demultiplexer,  the  most  important  component  m 
the  system,  separates  the  pulses  on  the  incoming  16  Gb  s 
stream  into  four  streams  each  at  4  Gb/s.  A  block  diagram 
of  the  demultiplexer  and  its  associated  electronics  is 
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7.1  Overview 

High  performance  spatial  light  modulators  (SLMs)  are  essential  in  many  optical 
information  processing  and  computing  applications  for  converting  incoherent 
images  to  coherent  replicas  suitable  for  subsequent  processing  [7.1,2].  A  typical 
spatial  light  modulator  consists  of  a  photosensitive  element  to  capture  the  inco¬ 
herent  light  image  and  an  optical  modulator  element  to  impress  the  incoherent 
input  image  content  onto  a  coherent  readout  beam.  A  particularly  important 
class  of  spatial  light  modulators  empl<^s  photorefractive  crystals  that  combine 
both  photosensitive  and  electrooptic  modulation  functions  within  the  same 
medium.  Examples  of  electrooptic  spatial  light  modulators  that  utilize  photore- 
hactive  crystals  include  the  Pockels  Readout  Optical  Modulator  (PROM)  [7.3] 
and  the  PRIZ  (a  Soviet  acronym  for  a  crystallographically  modified  PROM) 
[7.4]. 

During  operation  of  the  Pockels  Readout  Optical  Modulator,  the  input 
image-bearing  beam  creates  photoinduced  carriers  which  are  longitudin^y 
separated  by  an  applied  bias  electric  field.  This  charge  separation  produces  a 
space-variant  division  of  the  applied  field  across  the  active  electrooptic  crystal 
and  one  or  more  dielectric  blocking  layers,  as  shown  in  the  upper  left  quadrant 
of  Fig.  7.1.  The  bcal  birefringence  of  the  medium  depends  on  the  longitudinal 
component  of  the  local  electric  field,  and  hence  can  be  sensed  by  a  polar¬ 
ized  readout  beam  observed  through  an  exit  analyzer.  In  the  PRIZ,  the  same 
charge  generation  and  separation  process  is  utilized,  with  the  difference  that 
the  crystallographic  orientation  is  dmoen  to  emphasize  readout  sensitivity  to 
the  transverse  components  of  the  induced  electric  &ld  distribution,  as  shown 
in  the  upper  right  quadrant  of  Hg.  7.1. 

The  PROM  and  the  PRIZ  an  typically  limited  in  spatial  frequency  re¬ 
sponse  to  of  order  10  cydes/mm  at  optimum  optical  exposun  [7.5],  and  hence 
an  limited  to  relatively  modest  ba^width  optical  processing  and  comput¬ 
ing  appficatioos.  The  physical  configurations  of  the  PROM  and  PRIZ  devices 
do  not  lend  themselves  readily  to  exploitation  of  the  remarkably  high  spatial 
bandwidths  available  in  holo^^hic  configurations,  in  which  the  input  image 
is  encoded  on  a  qiatial  carrier  (as  shown  in  the  lower  left  quadrant  of  Fig.  7.1 
and  discussed  extensive^  ebewben  in  this  book).  Even  when  utilizing  the 
same  electrooptk  crystal  as  in  the  PROM  and  PRQ  (typically  Insmuth  silicon 
oodde),  holographic  recording  configurations  employing  transverse  applied  elec- 
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Fif.  7.1.  Schematic  diagram  of  the  principal  componcnta  of  the  applied  voltage,  charge 
tranaport,  acnaed  electric  field  oomponenta,  and  input  light  wave  vocton  for  four  typea  of 
tpatial  light  modulatora  that  utiliaa  aingle  cryatai  biamuth  lilieon  oxide  (BiijSiOre)-  The 
acronym  VHOC  ctanda  for  volume  holopaphic  optical  dement;  likowiae  PROM  etanda  for 
the  Pockela  Readout  Optical  Modulator,  PRIZ  ia  a  Soviet  acroiqrm  for  a  cryatallographically 
modified  PROM,  and  PIOOC  etanda  for  the  photorefiractive  incoherent-to-coherent  opticd 
converter,  the  aubject  of  thia  chapter 


trie  field*  and  no  bloridng  layers  have  been  shown  to  exhibit  spatial  firequency 
bandwidths  in  excess  of  2000  cydes/mm.  However,  such  purely  holographic 
recording  requites  input  of  image-dependent  information  on  one  of  two  odier- 
ent  input  beams,  and  as  such  cannot  be  directly  utilized  for  performing  the 
incoherent-to-coherent  conversion  function. 

A  fourth  distinct  type  of  photorefiractive  spatial  light  modulator  has  been 
independently  pn^Msed  by  Kumtkihn  and  Petrov  [7.6]  and  by  the  present  au¬ 
thors  [7.7,8]  which  combines  the  incoherent-to-coherent  converaon  function 
with  an  essentially  hologr^ihic  recording  process,  and  thereby  exhibits  several 
advantages  of  eadi.  As  in  the  hologr^>hic  recording  case,  a  transverse  applied 
electric  field  is  used  in  coiijunction  with  two  uniform  coherent  writing  beams 
to  produce  a  volxime  grating  that  is  then  selectively  modified  by  a  third  beam 
encoded  with  the  information  content  to  be  stored  or  converted.  This  configu- 
raticn  is  shown  schematically  in  the  lower  right  quadrant  of  Fig.  7.1.  The  Pho- 
torefractive  Incoherent-to-Coherent  Optical  Converter  (PICOC)  [7.7, 8]  device 
is  capable  of  recyclable  real  time  operation,  is  characterized  by  an  enhanced 
spatial  firequency  response,  and  is  even  simpler  to  construct  than  the  PROM 
or  PRIZ.  In  addition,  the  PICOC  device  configuration  allows  its  use  in  many 
quari-holographic  techniques  (such  as  optical  phase  conjugation),  which  in  turn 
lead  to  potentially  novel  optical  information  processing  and  computing  archi¬ 
tectures  [7.9-11]. 


The  photorefractive  incoherent-to-cohereat  optical  conversion  process  is 
described  in  Sect.  7.2,  as  are  alternative  sequencing  schemes  and  optical  imple¬ 
mentations.  In  Sect.  7.3,  the  limiting  assumptions  needed  to  derive  a  tractable 
analytical  model  of  PICOC  performance  are  specified,  in  preparation  for  de¬ 
tailed  discussion  of  the  recording  stage  in  Sect.  7.4,  and  of  the  readout  stage  in 
Sect.  7.5.  Conclusions  and  future  research  directions  are  offered  in  Sect.  7.6. 


7.2  Physical  Principles  and  Modes  of  Operation 

The  photorefractive  incoherent-to-coherent  optical  conversion  (PICOC)  process 
is  perhaps  best  understood  as  an  extension  of  the  more  familiar  holographic 
recording  process  in  a  photorefractive  medium.  The  physical  principles  gov¬ 
erning  such  recording  are  briefly  reviewed  in  this  section,  and  in  much  greater 
detail  in  Sect.  7.4.  The  extension  of  this  recording  process  to  include  PICOC 
allows  for  at  least  three  different  temporal  modes  for  sequencing  the  coherent 
grating  with  respect  to  the  incoherent  image.  These  modes  are  identified  and 
compared  in  this  section.  In  addition,  two  alternative  optical  architectures  are 
defined,  and  converted  images  generated  by  one  representative  configuration 
are  presented. 

The  high  sensitivity  of  photoconductive  and  electrooptic  crystals  such  as 
bismuth  silicon  oxide  (Bii2SiOM<  or  BSO)  in  the  visible  portion  of  the  spec- 
trrun  has  allowed  the  simultaneous  recording  and  reading  of  volume  holograms 
to  be  achieved  with  time  constants  amenable  to  real-time  operation  [7.12].  The 
holographic  recording  process  in  photorefractive  materials  involves  photoexd- 
tation,  charge  transport,  and  trapping  mechanisms  [7.13).  When  two  coherent 
beams  are  allowed  to  interfere  within  the  volume  of  such  a  crystal,  free  carriers 
are  nonuniformly  generated  by  absorption  and  are  redistributed  by  diffusion 
and/or  drift  under  the  influence  of  an  externally  applied  electric  field.  Subse¬ 
quent  trapping  of  these  charges  at  relatively  immobile  trapping  sites  generates  a 
stored  space-charge  field,  which  in  turn  modulates  the  refractive  index  through 
the  linear  electrooptic  (Pockets)  effect  and  thus  records  a  volume  phase  holo¬ 
gram.  If  both  cdierent  writing  beams  are  plane  waves,  the  induced  hologram 
consists  of  a  uniform  grating. 

In  the  photorefractive  incoheient-to-coherent  optical  conversion  (PICOC) 
process,  an  incoherent  image  is  focused  in  the  volume  of  the  photorefractive 
material  in  addition  to  the  coherent  grating  beams,  creating  an  additkmal  spa¬ 
tial  modulation  of  the  charge  distribution  stored  in  the  crystal.  This  spatial 
modtilation  can  be  transferred  onto  a  coherent  readout  beam  by  reconstructing 
the  holographic  grating.  The  spatial  modulation  of  the  coherent  reconstructed 
beam  will  then  be  a  negative  replica  of  the  input  incoherent  image,  as  shown 
in  Fig.  7.2.  It  should  be  noted  here  that  a  reliUed  image  encoding  process  can 
be  implemented  oonholographically  by  premultiplication  of  the  image  with  a 
grating  [7.14]. 
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Fig.  7.2.  Principle  of  operation  of  the  photorefreetive  incoherent-to-coherent  optical  con¬ 
verter  (hereinafter  refeRed  to  ae  PICOC) 


Pig.  7.3.  PICOC  in  the  grating  eramre  mode  (GEM),  in  which  the  carrier  grating  ia  recorded 
before  the  incoherent  image-bearing  aignal 
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In  PICOC,  the  holographic  grating  can  be  recorded  before,  during,  or 
after  the  crystal  is  exposed  to  the  incoherent  image.  Therefore,  several  distinct 
operating  modes  are  possible.  These  include  the  grating  erasure  mode  ( GEM; 
Fig.  7.3);  the  grating  inhibition  mode  (GIM;  Fig.  7.4),  and  the  simultaneous 
erasure/ writing  mode  (SEWM;  Fig.  7.5). 

In  the  grating  erasure  mode  (GEM),  shown  schematically  in  Fig.  7.3,  a 
uniform  grating  is  first  recorded  by  interfering  two  coherent  writing  beams  in 
the  photoreiractive  crystal.  The  writing  beams  are  turned  off,  and  this  grating  is 
then  selectively  erased  by  incoherent  illumination  of  the  crystal  with  an  image¬ 
bearing  beam.  The  incoherent  image  may  be  incident  either  on  the  same  face  of 
the  crystal  as  the  writing  beams,  or  on  the  opposite  face.  When  the  absorption 
coefficients  at  the  writing  and  image-bearing  beam  wavelengths  give  rise  to 
significant  depth  bonuniformity  within  the  crystal,  these  two  cases  will  have 
distinct  wavelength-matching  conditions  for  response  optimization  [7.8]. 

In  the  grating  inhibition  mode  (GIM),  shown  schematically  in  Fig.  7.4, 
the  crystal  is  pre-illuminated  with  the  incoherent  image-bearing  beam  prior 
to  grating  formation.  This  serves  to  selectively  decay  (enhance)  the  applied 
transverse  electric  field  in  exposed  (unexposed)  regions  of  the  crystal.  After 
this  pre-exposure,  the  coherent  writing  beams  are  then  allowed  to  interfere 
within  the  crystal,  causing  grating  formation  with  spatially  varying  efficiency 
due  to  significant  differences  in  the  local  effective  applied  field. 

In  the  simiUtaneous  eraeare/ writing  mode  (SEWM),  shown 
in  Fig.  7.5,  the  incoherent  image  modulation,  the  coherent  grating  fomuition 


Fig.  r.4.  PICOC  in  the  grating  inhibitiea  mods  (GIM),  in  which  the  earrisr  grating  is 
recorded  altar  the  incoharont  image  hearing  signal 
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Fig.  7.5.  PICOC  in  the  aimultancoua  MMurc/ writing  mode  (SEWM),  in  which  the  carrier 
grating  and  the  incoherent  image-bearing  ngnal  are  recorded  simultaaaoualy 


process,  and  the  readout  function  are  performed  simultaneously.  A  stable  image 
is  transcribed  after  the  space-cha^  field  has  reached  steady  state. 

A  further  distinction  can  be  made  in  the  operating  modes  between  strictly 
cyclic  exposure/readout,  with  an  upper  limit  on  the  readout  time  interval,  and 
operation  in  which  prolonged  readout  times  are  required.  Cyclic  readout  can  be 
achieved  by  any  one  of  the  three  sequencing  modes  introduced  above,  and  the 
sensitometry  requirement  for  achieving  good  quality  images  can  be  expressed  in 
terms  of  optical  exposure  (i.e.,  <9tical  energy  per  unit  area).  When  prolonged 
readout  is  required,  degradation  of  the  stored  space-charge  profile  can  occur, 
due  either  to  dark  current  or  erasure  induced  by  the  optical  readout  beam.  The 
most  appropriate  sequencing  mode  for  prolonged  readout  is  the  simultaneous 
erasure/writing  mode  (SEWM)  because  it  constantly  regenerates  the  space- 
charge  field  profile.  However,  the  sensitometry  requirement  for  this  latter  mode 
is  better  expressed  in  terms  of  optical  pow«>r  rather  than  optical  exposure,  as¬ 
suming  readout  time  intervals  long  compared  with  the  time  required  to  achieve 
a  stable  steady  state  readout  image.  In  exchange  for  this  optical  energy  penalty, 
SEWM  offers  a  considerably  simplified  experimental  configuration  with  no  need 
for  temporal  sequencing,  a  much  greater  tolerance  for  pbotorefractive  crystals 
with  increased  dark  conductivity,  and  readout  of  essentially  tmlimited  duration. 
Readout  light  beams  of  much  shorter  wavelength  and/or  much  higher  intensi¬ 
ties  can  be  accommodated  in  SEWM  without  incurring  unacceptable  erasure 
of  the  charge  pattern.  Because  of  its  experimental  convenience  and  analytical 
simplicity,  SEWM  is  emphasized  in  this  chapter,  with  more  detailed  discussion 
of  GEM  and  GIM  given  later  in  Sect.  7.4.4. 

Optical  ImpleoMBtatioas.  The  origmal  implementation  of  PICOC  described  by 
Kamshilin  and  Petrov  [7.6]  is  a  modification  of  the  nondegenerate  four-wave 
mixing  geometry  to  include  simultaneous  exposure  by  an  incoherent  image¬ 
bearing  beam.  This  configuration  requires  a  readout  wavelength  separate  and 
distinct  from  the  coherent  grating  writing  wavelength,  which  then  allows  the 
readout  wavelength  to  be  selected  for  significantly  reduced  grating  erasure 
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rates.  Thus,  the  grating  inhibition  mode  (GIM)  and  the  grating  erastire  mode 
(GEM)  are  best  implemented  in  this  configuration.  However,  the  optical  align¬ 
ment  is  more  intricate  with  this  architecture  than  it  is  with  the  degenerate 
four-wave  mixing  geometry  introduced  next,  since  the  Bragg  angle  of  the  read¬ 
out  betun  will  not  be  the  same  as  the  Bragg  angle  of  the  coherent  writing  beams 
(shown  in  Fig.  7.6). 

An  alternative  optical  implementation  is  a  modification  of  the  conventional 
degenerate  four-wave  mixing  geometry  to  include  simultaneous  expostire  by  an 
incoherent  image-bearing  beam,  as  shown  in  Fig.  7.7.  This  implementation  has 
the  advantage  of  extremely  easy  optical  alignment,  as  the  readout  beam  is 
readily  Bragg  aligned  by  retrorefiecting  one  of  the  two  coherent  grating  writing 
beams.  It  has  the  disadvantage  that  the  readout  beam,  being  at  the  same  wave¬ 
length  as  the  coherent  grating  writing  beams,  must  erase  the  grating  structure 
being  probed  at  rates  comparable  to  the  writing  process,  assuming  a  readout 
light  intensity  comparable  to  the  writing  light  intensities.  Thus  this  implemen¬ 
tation  can  be  utilized  for  the  simultaneous  erasure/ writing  mode  (SEWM),  and 
can  be  adopted  for  the  grating  erasure  mode  (GEM)  and  the  grating  inhibition 
mode  (GIM)  only  by  significantly  reducing  the  probe  beam  intensity,  with  a 
correspondingly  reduced  readout  signal  intensity. 

As  a  si^ific  example  of  the  PICOC  process,  consider  a  degenerate  four- 
wave  mixing  configuration  in  which  the  coherent  writing  beams  and  the  inco¬ 
herent  image-bearing  beam  were  made  to  illuminate  the  same  face  of  a  1.3  mm 
thick  crystal  of  bismuth  silicon  oadde,  obtained  from  Crystal  Technology,  Inc. 
.4n  electric  field  of  4kV/cm  was  applied  along  the  (1 10)  axi<i,  as  shown  in 
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Fig.  7.7,  D«g«Mr«u  feur-w»v«  minng  wchitactur*  to  pciform  th«  photonfractiv*  ineohcrcnt- 
to-cohcNfit  optical  eonvofaion 


Fig.  7.t.  Biamuth  ailicon  oxida  cryatal  oriantation  for  tha  PICOC  tranavaraa  alaetrooptic 
configuration  and  recording  gaomatry.  Tha  volume  holographic  grating  with  wave  vactor  Kq 
ia  formad  by  tha  coharant  writing  baama  /]  and  It,  and  the  incoharant  imaga  information  ia 
encodod  on  baam  1% 

Fig.  7.8.  A  300  cycleg/mm  grating  was  written  by  the  515  nin  line  of  an  argon 
ion  laaer  with  the  grating  wave  vector  oriented  parallel  to  the  applied  bias  field 
to  maxiiniae  the  diftaction  efficiency.  The  image-bearing  light  source  was  either 
a  xenon  arc  lamp,  a  tungsten  lamp,  or  the  488  nm  line  of  the  argon  ion  laser. 
The  average  coherent  grating  intensity  was  0.4mW/cm^  and  the  image-bearing 
light  intensity  was  typically  8.0mW/cm^.  The  coherent  grating  writing  beams 
were  polarized  orthogcmal  to  the  applied  electric  field.  A  polarizer  was  inserted 
at  the  output  to  minimize  coherent  optical  scatter  from  the  crystal  [7.15]. 


Sample  converted  images  obtained  from  two  binary  transparencies  (a  spoke 
target  and  an  U.S.  Air  Force  resolution  target)  and  from  two  black-tuid-white 
slides  with  continuous  tone  gray  scale  are  shown  in  Fig.  7.9.  The  original  trans¬ 
parency  and  its  converted  image  have  reversed  contrast,  as  shown  in  this  figure 
and  as  explained  by  Fig.  7.2.  An  approximate  resolution  of  15  line  pairs/mm 
(determined  from  the  resolution  target  image)  was  achieved  without  optimizing 
factors  such  as  the  Bragg  readout  condition.  Such  optimization  results  in  strik¬ 
ing  enhancements  of  the  resolution  to  of  order  50  line  pairs/mm,  as  discussed  in 
Sect.  7.5  below.  Similar  images  of  comparable  quality  have  also  been  recorded 
in  a  bismuth  silicon  oxide  crystal  in  which  the  (001)  axis  is  aligned  parallel  to 
the  coherent  grating  wave  vector  and  to  the  applied  bias  electric  field. 

Having  reviewed  in  broadest  terms  the  physical  principles  and  modes  of 
operation  of  the  PICOC  process,  let  us  now  delineate  the  scope  and  limitations 


Fig.  7.9»-d.  ExampUs  of  the  cenvmioa  of  binary  and  gray-kvtl  tranaparenciaa:  (a)  spoke 
target,  (b)  U.S.  Air  Force  resolution  target,  (c)  airplane,  and  (d)  an  incoherent  student 


of  the  analytical  model,  aa  given  in  the  next  section,  in  preparation  for  more 
detailed  studies  of  the  recording  process  in  Sect.  7.4  and  the  readout  process  in 
Sect.  7.5. 

7.3  Delineation  of  the  Analytical  Model 

A  reasonably  accurate  and  complete  model  of  the  photocefractive  incoherent-to- 
coherent  optical  conversion  process  is  based  upon  a  set  of  equations  that  govern 
trap  and  electron  balance,  electron  transport,  and  the  buildup  of  a  space-charge 
field,  as  detailed  in  Sect.  7.4.1.  This  model  exhibits  both  nonlocal  response 
due  to  charge  transport  and  striking  nonlineahties.  No  analytical  solution  has 
yet  been  identified  that  is  broadly  applicable  to  the  full  range  of  important 
experiments  (e.g.,  experiments  involving  large  modulation  depths  and  photo- 
induced  variations  in  the  recombination  time).  Numerical  solutions  are  certainly 
feasible,  but  have  not  yet  been  fully  explored.  To  help  refine  physical  insight  into 
the  conversion  process,  an  ^proximate  model  c^>able  of  analytic  solution  needs 
to  be  defined,  but  its  interpretation  must  be  tempered  with  careful  attention 
to  its  limitations. 

Two  such  approodmate  solution  models  are  identified  herein.  The  first  ap¬ 
proach,  the  “constant  recombinatioo  time  ^>ptaadmation,"  is  based  upon  the 
analytical  studies  of  Mokarum  et  al.  [7.16],  and  was  presented  previously  by 
the  authors  [7.8].  This  approach  is  discussed  in  sufficient  detail  in  Sect.  7.3.1 
to  enable  comparison  wi^  the  second  iq>proe’di,  the  “perturbation  series  ap¬ 
proximation, “  as  introduced  in  Sect.  7.3.2.  These  two  ^>proaches  are  compared 
and  contrasted  in  this  section.  The  perturbation  series  approximation  is  then 
used  as  the  basis  for  cootinued  discussion  of  PICOC  p^ormance  diaracter- 
istics  throughout  the  remainder  of  the  chapter,  and  is  detailed  more  fully  in 
Sect.  7.4.1. 

7.3.1  Conatant  Rscoinbinniion  Time  Approximation 

One  such  q>proadmate  model  of  the  PICOC  process  [7.8]  evolves  from  analytic 
solutions  of  single  spatial  frequency  grating  recording  as  derived  by  Mohanm 
et  al.  [7.16].  This  modd  m>prasdmates  quite  well  the  nonlinearity  of  the  con¬ 
version  process,  but  it  is  l^ted  in  scope  to  steady  state  behavior.  Hence  the 
analytical  solutions  of  this  model  (derived  in  [7.8])  are  suitable  only  for  study¬ 
ing  the  simultaneous  erasure/writing  mode  (SEWM)  in  the  steady  state  regime, 
and  cannot  cope  with  the  grating  erasure  mode  (GEM)  and  the  grating  inhi¬ 
bition  mode  (GIM)  response.  A  further  hmitation  of  this  approximation  is  its 
assumption  of  a  constant  recombination  time  (which  is  equivalent  to  ■friming 
an  infinite  trap-limited  saturation  field  strength),  and  so  this  will  hereinafter  be 
referred  to  as  the  “constant  recomtanation  time”  model  A  more  accurate  study 
of  the  photorefractive  effect,  as  defined  by  a  set  of  photoconductivity  equatiims, 
allows  f<»  photoinduced  variations  of  the  recombination  time  frmn  its  nominal 
value.  Although  these  variations  may  be  quite  small  in  amplitude  compared 
with  the  nominal  value,  they  nevertheless  provide  a  very  important  mecha- 
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nisrn  that  can  significantly  modify  the  space-charge  field  for  certain  recording 
configurations. 

.4  key  assumption  of  the  constant  recombination  time  model  is  the  absence 
of  self-difFrturtion.  Self-diffraction  is  the  process  by  which  the  grating  written  at 
the  entrance  of  the  photorefractive  crystal  diffracts  a  portion  of  the  coherent 
writing  beams,  modifying  the  interference  pattern  and  hence  the  grating  that 
is  recorded  deeper  in  the  crystal  [7.13, 17,37].  This  process  can  be  neglected  for 
sufficiently  thin  crystals  and  low  diffraction  efficiencies  (e.g.,  2  mm  of  bismuth 
silicon  oxide)  tmd  allows  a  considerable  simplification  of  the  analysis.  If  alter¬ 
native  crystals  with  significantly  higher  electrooptic  coefficients  and/or  thicker 
crystals  are  used,  then  the  diffraction  efficiency  would  increase,  self-diffiraction 
effects  would  be  far  more  pronounced,  and  thus  the  framework 

described  in  this  chapter  would  need  to  be  modified. 

7.3.2  Perturbation  Series  Approximation 

By  recasting  the  chosen  set  of  photorefractive  equations  in  Fourier  transform 
space,  additional  physical  insights  into  the  transcription  process  can  be  achieved 
which  lead  naturally  to  a  perturbation  series  formulation  of  the  conversion 
process.  Analytic  solutions  can  be  derived  for  the  first  few  terms  of  this  series 
without  restricting  such  solutions  to  the  steady  state  regime,  and  so  define  an 
additional  approximate  model  of  the  PICOC  process.  Limiting  the  analysis  to 
the  first  few  terms  means  that  the  strong  ncmlinearities  that  characterize  the 
incoherent-to-coherent  optical  conversion  process  are  estimated  at  best,  but  the 
most  compelling  advantage  of  this  apptoa^  'is  its  ability  to  model  the  temporal 
evolution  of  the  space-charge  field.  Study  of  the  temporal  behavior  is  crucial 
to  the  analysis  of  the  grating  erasure  mode  (GEM)  and  the  grating  inhibition 
mode  (GIM).  In  this  chapter,  we  use  a  perturbation  model  that  predicts  the 
various  spatial  frequency  components  of  the  space-charge  field. 

Consider  for  example  a  coherent  grating  beam  /g(*)  form 

Ai(*)  =  /o(l  +  tno  cos  Kqx)  ,  (7.1) 

in  which  me  is  the  modulation  depth  and  Kq  is.  the  wave  vector  associated 
with  the  coherent  grating,  and  an  incoherent  signal  beam  /$(*)  ^  form 

^s(*)  =  A(1  +  mg  cos  Ksx)  ,  (7.2) 

in  which  mg  is  the  modulation  depth  and  ITg  is  the  wave  vector  associated  with 
the  image  profile.  The  coordinate  system  is  (kfined  such  that  x  is  parallel  to  the 
applied  bias  electric  fiidd  £Sti  which  is  also  parallel  to  the  coherent  grating  wave 
vector  Kq;  x  is  orthogonal  to  the  entrance  fiMe  ai  the  photorefractive  crystal; 
y  is  defined  to  axnplete  a  right-handed  coordinate  system.  Because  of  the 
nonlinearities  in  the  recording  process,  the  space-charge  field  E{x)  transcribed 
by  these  light  beams  contains  spatial  frequencies  that  do  not  exist  in  the  original 
light  profiles.  These  intermodulatkm  terms  have  the  form 

E(r)«  53  £mn«tp(i(m#:Q.|.n#:g)*]  .  (7.3) 
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If  the  recording  procen  were  perfectly  linear,  then  only  terms  such  as  Eio  (i.e., 
m  =  1  and  n  s  0)  and  £oi  (m  as  0  and  n  s  1)  would  appear  in  which  one 
of  the  two  subscripts  is  zero,  assuming  light  profiles  as  defin^  by  (7.1,2).  The 
nonlinearity  of  the  recording  process  induces  new  spatial  harmonics  such  as 
£ii  which  describe  the  modulation  of  the  coherent  grating  by  the  incoherent 
image  beam  and  hence  ate  central  to  the  photorefractive  incoherent-to-coherent 
optical  conversion  process. 

The  intermodulation  decomposition  (7.3)  provides  a  natural  framework 
for  a  perturbation  series  analysis  of  the  photorefractive  incoherent-to-coheient 
optical  conversion  process  in  powers  of  the  modulation  depths  rtiQ  and  ms, 
a  technique  first  demonstrated  by  Kvkhtam  et  al.  as  applied  to  the  analysis 
of  single  grating  transcription  [7.18, 19],  and  extended  by  Ocho*  et  al.  [7.20] 
and  by  Refngier  et  al.  [7.21].  In  practice,  a  typical  image  consists  of  a  mul¬ 
tiplicity  of  spatial  frequency  components,  not  just  the  single  frequency  signal 
term  postulated  in  (7.2).  Such  a  spectrally  rich  image  will  necessarily  introduce 
an  additional  summation  over  the  spectral  harmonics  in  (7.3).  The  resultant 
series,  if  limited  to  tenns  of  the  form  £ii,  predicts  a  linear  transcription  of  the 
image  profile  for  whidi  the  response  to  each  spatial  frequency  component  can 
be  evaluated  separately,  sudt  that  the  total  response  is  determin^  by  sum¬ 
ming  up  all  harmonica.  Higher  order  terms,  such  as  E12,  describe  n<»ilinear 
distortion  of  the  image  spectrum  in  which  new  image  frequencies  are  generated 
that  do  not  exist  in  the  original  incoherent  image  profile.  These  higher  order 
terms  prove  to  be  extremely  tedious  to  calculate.  H  these  hi|^er  order  terms 
contribute  significantly  to  the  conversion  process,  then  alternative  methods  of 
analysis  such  as  numerical  modeling  are  recommended.  The  analysis  presented 
in  this  ch^>ter  concentrates  on  the  Eu  term. 

Analytical  expressions  defining  the  temporal  evolution  of  the  first  few  har¬ 
monic  terms  of  (7.3)  are  readily  derived.  As  shown  in  Appendix  7.A  and  in 
[7.22],  specific  perturbation  terms  for  the  simultaneous  erasuze/wiiting  mode 
(SEWM)  in  the  saturation  limit  can  be  apjamrimated  by 

Enx  =  -imf (7.4) 

^10  »  and  (7.5) 

Ell  «  ,  (7.6) 

in  which  £b  >■  applied  Uas  field  shown  in  Fig.  7.8,  and  and  are 
effective  modulation  depths  to  be  defined  in  the  next  section. 

If  bismuth  silicon  oxide  is  chosen  as  the  photorefractive  medium  of  in¬ 
terest,  then  the  refractive  index  modulation  ^n(z)  induced  along  a  principal 
electrooptic  axis  of  the  crystal  by  the  linear  electrooptic  effect  is  proportional  to 
the  induced  space-charge  field  E{x)  [7.23].  Each  of  the  spatially  periodic  terms 
in  the  spece-charge  field  (7.3)  thus  induces  a  distinct  volume  phase  grating 
through  the  linear  electrooptic  efi^t.  Such  a  superposition  of  phase  gratings 


diffracts  an  incident  collimated  readout  laser  beam  into  a  multiplicity  of  dis¬ 
crete  beams,  as  shown  in  Fig.  7. 10. 

Another  fundamental  insight  into  the  PICOC  process  that  evolves  nat¬ 
urally  from  the  perturbation  series  approach  is  the  existence  of  a  one-to-one 
mapping  of  £tnn.  the  spatial  frequency  components  of  the  space-charge  held, 
into  /mn,  the  diffraction  orders  and  suborders  of  the  spatially  modulated  read¬ 
out  light.  This  identification  presumes  weak  diffraction  efBciencies.  as  have  thus 
far  been  typical  of  most  PICOC  implementations.  In  the  far  field  diffraction 
limit,  shown  in  Fig.  7.10,  the  spatially  modulated  readout  light  profile  /r(z) 
decomposes  into  the  usual  set  of  discrete  diffraction  orders  associated  with  the 
coherent  grating,  labeled  by  subscript  m,  while  each  of  these  orders  further 
decomposes  into  subharmonics  associated  with  the  incoherent  image  signal,  la¬ 
beled  by  subscript  n.  This  mapping  dfers  a  most  convenient  method  to  test 
the  behavior  of  distinct  spatial  frequency  components  of  the  space-charge  field 
Emn  during  the  conversion  process. 

For  typical  coherent  grating  spatial  frequencies  and  typical  crystal  thick¬ 
nesses,  the  volume  phase  gratings  in  the  photorefractive  crystal  are  recorded 
deep  within  the  Bragg  regime.  Therefore  the  optical  readout  process  can  re¬ 
spond  at  most  to  (me  diffraction  order  such  as  /lo  and  its  immediate  sub- 
harmonics  such  as  /ii,  while  excluding  other  diffraction  orders  and  associated 
sidebands  such  as  loi-  Thus  for  the  illumination  profiles  Ig{x)  and  /s(z)  of  the 
form  (7.2,3),  and  assuming  selective  diffraction  into  only  the  -t-lst  diffracti<m 
order  and  its  immediate  sidebands,  a  typical  form  of  the  modulated  readout 
light  profile  /r(z)  would  be 

^r(*)  =*  Ao  +  Al  C08  (^S*)  +  higlwt  image  hazmonics  .  (7.7) 

Furthermore,  assuming  perfect  Bragg  alignment  for  the  /jo  term,  and  assuming 
image  frequencies  Ks  small  enough  to  avoid  Bragg  misalignment  of  the  imme¬ 
diately  adjacent  subhannonics,  the  diffracted  intensity  terms  Jio  and  /n  are 
proportional  to  the  squares  of  the  corresponding  space-charge  field  components 
£io  and  En  for  sufficiently  low  diffraction  efficiencies.  Combining  (7.4-7)  and 
approximating  for  small  modulation  depths  gives 
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(laid  into  difraction  ocdara  of  tha  modulatad 
rMdout  light  baam 


/r{i)  <x  (m^)*(l  -  Am^  cot  (A’sx)]  (7.8) 

in  which  higher  order  terxna  have  been  neglected.  Thus  we  can  see  from  (7.8) 
that  the  incoherent  image  has  been  transcribed  onto  the  coherent  readout  beam 
with  a  reversal  of  image  contrast.  The  mapping  of  Emn  ipto  Imn  also  enables 
the  possibility  of  spatial  filtering  of  the  image  beam  to  change  the  negative 
image  into  a  positive  image  (by  a  schlieien  technique),  and  to  improve  the 
contrast  ratio  without  suffering  an  associated  reduction  in  image  intensity. 

The  perturbation  series  approach  that  leads  to  (7.8)  is  most  effective  when¬ 
ever  the  higher  spatial  harmonics  in  (7.7)  can  be  neglected  in  favor  of  the 
lowest  harmonics,  and  this  ^plies  whenever  the  modulation  depths  and 
m|^  are  sufficiently  small  (or  the  spatial  frequencies  are  sufficiently  large). 
Such  small  modulation  depths  occur  only  for  very  restrictive  conditions  whidi 
seldom  match  the  experimental  conditions.  Therefore  the  scope  of  the  per¬ 
turbation  solution  is  limited,  which  reflects  as  much  a  fundamental  difficulty 
with  the  PICOC  process  as  it  does  a  difficulty  with  the  analytic  technique. 
The  PICOC  process  requires  a  nonlinear  response  to  generate  the  modulation 
of  the  coherent  grating  Iqr  the  incoherent  image  field,  and  hence  strong  levels 
of  nonlinear  distortion  inextricably  occur  with  significant  levels  of  modulated 
light  intensity.  Restricting  the  <^ttentioo  to  the  lowest  order  spatial  harmonics 
underestimates  the  rumlinearities,  but  leads  to  simple  and  powerful  analytical 
expressions  for  the  temporal  response  that  contain  considerable  physical  in¬ 
sist  into  the  PICOC  process  and  are  unobtainable  by  any  other  analytical 
technique. 

With  these  comments  as  backdrop,  we  can  now  proceed  with  a  more  de¬ 
tailed  study  of  the  recording  process  in  Sect.  7.4  and  of  the  readout  process  in 
Sect.  7.5. 


7.4  The  Recording  Process 

During  the  recording  process,  a  space-charge  electric  field  E{z)  is  formed  in  the 
photorefractive  cryst^  in  response  to  the  combined  illumination  by  a  coher¬ 
ent  grating  light  beam  /g(x)  and  an  incoherent  image  beam  /s(x).  A  physical 
model  describing  this  transcription  process  is  reviewed  in  Sect.  7.4.1,  and  par¬ 
ticular  numerical  and  analytical  solutions  are  listed  for  the  simultaneous  era¬ 
sure/writing  mode  (SEWM).  The  implications  of  these  results  on  the  nonlinear 
transfer  fxmction  of  the  conversion  process  are  explored  in  Sect.  7.4.2.  Issues 
affecting  the  resolution  of  the  recording  process  are  discussed  in  Sect.  7.4.3. 
Finally,  the  temporal  evolution  characteristics  are  studied  in  Sect.  7.4.4. 

7.4.1  Physical  Model  and  Sample  Solutions 

The  model  of  photoconductivity  that  is  most  frequently  selected  to  describe 
holographic  grating  formation  in  photorefractive  crystab  assumes  a  single  mo¬ 
bile  charge  species  (electrons)  and  a  single  trapping  level  [7.13],  although  more 
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intricate  models  involving  hole  transport  (7.24.25)  and  multiple  trapping  levels 
[7.26]  have  occasionally  been  proposed  to  achieve  a  better  fit  with  particu¬ 
lar  e.^periments.  The  simple  single  trapping  level/single  mobile  charge  species 
model  which  has  been  chosen  to  describe  the  PICOC  recording  process  consists 
of  the  following  set  of  equations; 

dNr^ 

=  [SG/G(x.t)  +  5s/s(x.t)  l3](No  -  -  7RiV^n  (7.9) 


dn  dN^  Idj 
dt  ~  dt  edx 


dE  *  4. 

=  - (  "n  —  n  —  N.  ) 

dx  «o  °  ^ 


j  =  ennE  -b 


(7.10) 


(7.11) 


(7.12) 


in  which 


^V^(x.O 

n: 


n(x.t) 

E(x,t) 


7R 

7g(^.0 

7s(^.<) 


is  the  tc.  J  concentration  of  donor-like  trapping  centers, 
is  the  concentration  of  ionized  donor-like  trapping  centers 
in  quasi-equilibrium  under  dark  conditions, 
is  the  concentration  of  ionized  donor-like  trapping  centers, 
is  the  concentration  of  negatively  charged  acceptor-like 
centers  that  compensate  for  the  charge  ^  under  dark 

thermal  quasi-equilibrium  conditioru  is  a  constant  of 
the  crystal), 

is  the  concentration  of  electrons  in  the  conduction  band, 
is  the  internal  sp«^e-charge  electric  field, 
is  a  positive  number  with  magnitude  equal  to  the  electronic 
charge, 

is  the  cross  section  of  photo-ionization  for  the  coherent 
grating  beams  with  wavelength  Aq  divided  by  the  photon 
energy,  hereinafter  referred  to  as  a  photo-ionization  cross 
section, 

is  the  cross  section  of  photo-ionization  for  the  incoherent 
image  beam  with  wavelength  Ag  divided  by  the  photon 
energy,  hereinafter  refened  to  as  a  photo-ionization  cross 
section, 

is  the  thermal  generation  rate  of  electrons  into  the 
conduction  band, 

is  the  carrier  recombination  constant, 
is  the  optical  intensity  profile  for  the  coherent  grating, 
is  the  optical  intensity  ^ofile  for  the  incoherent  image, 
is  the  cxurent  density  in  the  crystal, 
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H  is  a  positive  number  with  magnitude  equal  to  the  charge 

carrier  mobility, 

k  is  Boltzmann’s  constant, 

r  is  the  absolute  temperature  of  the  crystal, 

£  is  the  static  dielectric  constant  of  the  crystal,  tmd 

£0  is  the  &ee  space  electric  permeability. 

Rationalized  MKS  units  are  assumed.  The  x  coordinate  b  transverse  to  the 
nominal  light  propagation  direction,  and  parallel  to  the  applied  bias  field  di¬ 
rection;  t  denotes  time.  Equation  (7.9)  is  the  rate  equation  describing  the  ex¬ 
citation  of  electrons  into  the  conduction  band  and  subsequent  recombination 
into  traps.  Equation  (7.10)  states  the  conservation  of  electric  charge.  Equation 
(7.11)  is  Maxwell’s  first  equation  for  the  electric  field.  Elquation  (7.12)  defines 
the  current  density  in  terms  of  drift  and  diffusion  components.  The  material 
parameters  for  bismuth  silicon  oxide  assumed  in  the  numerical  calculations  are 
listed  in  Table  7.1  taken  &om  the  works  of  Tanguay  [7.27]  and  Valley  and  Klein 
[7.28].  We  wish  to  solve  the  equations  for  the  space-charge  electric  field  E{x) 
which  is  induced  by  exposure  to  the  input  optical  intensities  Ig{x)  and  /s(^)- 

Single  Grating  Response.  Consider  first  the  case  of  a  single  spatial  frequency 
grating  /g(*)i  “  defined  by  (7.1)  of  the  previous  section,  recorded  in  the  ab¬ 
sence  of  an  incoherent  image  beam  /s(*)-  Because  of  the  nonlinearity  of  the 
recording  process,  the  key  variables  of  the  photoconductivity  model  consist  of 
a  superposition  of  harmonics  of  the  incident  light  beam,  i.e.. 


+00 


n{x,t)=  ^  n„(t)exp(imA'Gx)  , 

(7.13) 

ms— oo 

+O0 

H  N^m(t)exp{imKGx)  , 

(7.14) 

ms-oo 

+00 

E{x,t)=  53  i^m(t)«tp(imJfG*)  .  “d 

(7.15) 

ms-oo 

+00 

j(x,t)=  53  >m(t)exp(imifGi)  . 

ms-oo 

(7.16) 

These  harmonic  decompositions  can  be  substituted  into  (7.9—12),  resulting  in 
a  set  of  coupled  differential  equations  defining  the  temporal  evolution  of  each 
harmonic  component.  Thb  coupled  set  of  equations  can  either  be  integrated 
numerically,  as  demonstrated  by  Mokaram  et  al.  [7.16]  on  a  reduced  subset  of 
the  equations,  or  else  be  solved  approximately  by  perturbation  series  methods. 

Consider  first  a  perturbation  aeries  expanaioa  in  powers  of  the  modulation 
depth  me.  Analytical  expressions  for  the  first  order  terms  have  been  derived 


TabI*  T.t.  Material  parameters  of  bismuth  silicon  oxide  (BijSiOja) 


Parameter 

Symbol 

Value 

Reference 

.Mobility 

0.03  cm^ /Vs 

(7.28] 

Carrier 

lifetime 

r 

S  X  10~*s 

(7.28] 

Donor-like 
trap  density 

10*»cm“* 

(7.28  ] 

Dark  ionised 
trap  density 

10‘«cm-» 

(7  28] 

Recombination 

coefficient 

tr 

2  X  10“cm*/8 

(7.28] 

Dielectric 

constant 

€ 

S8 

(7.28] 

Symbol 

488  nm 

SlSnm  633  nm 

Units 

Reference 

Index  of 
refraction  . 

no 

2.8S0 

2.6IS  2.S30 

(7  27] 

Optical 

absorption 

a 

7.0 

2.8  08 

cm~* 

(7.27] 

Electrooptie 

coefficient 

'U 

4.S3 

4.S1  4.41 

pm/V 

(7.27] 

Photo-ionisation 
cross  section 
(divided  by 
photon  energy) 

Sq 

■ 

0.42 

cm’/Joule 

(7,28] 

NoU:  A  photo>ioiiisation  Croat  section  Ss  at  4Mnm  has  been  estimated  to  be  of  order 
1  cm’ /Joule  b]r  Ss  =  (asAs/aalko)5a,  assuming  identical  quantum  eSciancy  at  4M  and 
SIS  nm.  However  see  tte  rlisniminn  in  S^.  7.4.3  and  Sprofu*  [7.30]. 


by  KvkkUrev  et  aL  [7.18, 19],  such  that  in  the  steady  state  regime  the  first 
spatial  hasmonic  conqxment  £}  of  the  space-charge  fidd  is  given  by 


El 


Eq  +iE'o(K(;) 
D{Kq) 


(7.17) 


to  first  order  in  the  modulation  depth  me,  in  which  £3  is  the  applied  bias 
electric  field  strength.  In  (7.17),  the  denominator  function  D(Ka)  is  defined 
by 


D(Kg)  »  1  - 


.  Eq  -f  i£D(A'Q) 

'  Ei(Kg) 


(7.18) 


> 


and  the  diffuaion  field  E^{Kq)  and  the  trap-limited  saturation  field  strength 
E,i{Kg)  are  defined  by 


(7.19) 


Eq  = 


fCoA'c 


(7.20) 


The  trap-limited  saturation  field  strength  defines  the  highest  electric  field 
that  can  be  generated  by  a  sinusoidal  charge  distribution  with  marimuTn  diarge 
density  of  For  reduced  values  of  (and  hence  of  the 

amount  of  space  charge  and  resulting  space-charge  field  strength  is  lii^ted  as 
described  by  (7.17, 18).  Considerable  variation  has  been  reported  in  the  litera¬ 
ture  in  estimates  of  the  equilibrium  donor-lilce  trap  density  for  bismuth 
silicon  oxide  [7.29],  which  directly  affects  the  estimate  of  the  saturation  field 
strength  Eq,  and  hence  the  upper  limit  on  the  space-charge  field  strength. 

The  fint  order  perturbation  analysis  is  accurate  only  in  the  limit  of  low 
modulation  depths  me,  whereas  many  gratings  are  written  with  the  highest 
possible  modulation  depths.  Solutions  accurate  at  higher  modulation  depths 
can  be  obtained  by  numerical  methods,  with  sample  solutions  presented  in 
Fig.  7  .11  showing  the  space-charge  field  profiles  induced  by  a  single  grating  fre¬ 
quency  in  the  steady  state  regime  for  various  applied  bias  fields  and  various 
grating  frequencies.  A  modulation  depth  of  mQ  =  0.99  and  the  material  pa¬ 
rameters  listed  in  Table  7.1  are  assumed  for  all  curves.  Note  that  even  when 
the  writing  light  profile  is  cosintisoidal,  the  resulting  space-charge  field  pro¬ 
files  exhibit  significant  distortion  because  of  the  nonlinearity  of  the  recor^ng 
process. 

In  practice,  only  one  of  the  multiple  spatial  harmonics  of  the  coherent 
grating  wave  vector  Kc  can  dominate  the  optical  readout  process  because  the 
grating  u  tj'pically  recorded  very  deeply  into  the  Bragg  regime.  Figure  7.12 
therefore  shows  numerical  solutions  for  the  strength  of  just  the  first  spatial 
harmonic  component  ci  the  space-charge  field  as  a  frmetion  of  the  modula¬ 
tion  depth  mQ  (labeled  in  the  figure  as  in  anticipation  of  the  two  grating 
transcription  discussion  that  foUows,  but  to  be  interpreted  for  now  as  me). 
The  family  of  curves  shown  in  Fig.  7.12  is  parametrized  by  the  ratio  EqlEa, 
which  scales  the  relative  magnitudes  of  the  trap-limited  saturation  field  to  the 
applied  bias  field.  Those  portions  of  the  response  curves  in  Fig.  7.12  that  can 
be  approximated  by  a  straight  line,  namely  for  low  levels  of  the  modulation 
depth  rriQ  up  to  about  0.5  or  so,  are  the  regions  in  which  the  linear  ^proxima- 
tion  of  Kvkkttrtv  et  al.  [7.18, 19]  most  accurately  describes  the  transcription 
process.  In  anticipation  of  the  discussion  of  nonlinear  PICOC  response  given 
in  Sect.  7.4.2,  we  find  that  high  effective  modulation  depths  (defined  by 
(7.23)  below)  for  whidi  the  linear  approximatioa  breaks  down  are  associated 
with  weak  leveb  of  average  incoherent  image  intensity  I\  compared  with  the 
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HARMOMIC  COIARONENT  Of  £•*'9  imtcrnau  ELECTRIC  fiELO  internal  Electric  field  internal  electric  field 

THE  SMC£  CHARCC  FIELD  Ce,/Eo)  l-io*  v»iii/i»l  (« iO»  v<.it./c«l  (.  io«  vtoit./oi 


r  I  1000 
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average  coherent  grating  light  intensity  /q,  while  low  modulation  depths  for 
which  the  linear  approximation  is  most  accurate  occur  for  strong  levels  of  av¬ 
erage  incoherent  light  illumination  Ii- 


Two  Grating  Transcription.  When  an  incoherent  image  beam  is  present  simul¬ 
taneously  with  the  coherent  grating,  as  in  the  simultaneous  erasure/writing 
mode  (SEWM),  then  additional  terms  corresponding  to  harmonics  of  the  image 
frequencies  must  be  included  in  the  space-charge  field.  Consider  for  example  an 
image  consisting  of  just  one  sinusoidal  component,  as  given  by  (7.2),  with  a  re¬ 
sulting  space-charge  field  decomposition  of  the  form  of  (7.3).  Double  harmonic 
decompositions  analogous  to  (7.3)  can  be  asstimed  for  each  of  the  variables  in 
the  photoconductive  equations  (i.e.,  the  ionized  trap  density  the  electron 
density  n,  and  the  ctment  density  j). 

The  resulting  set  of  coupled  differential  equations  can  be  solved  either 
numerically  or  by  a  perturbation  series  analysis  with  respect  to  this  harmonic 
decomposition,  with  the  latter  being  the  approach  adopted  herein  [7.22].  The 
perturbation  aeries  approach  leads  to  analytical  expressions  for  the  linear  terms 
£io  and  £oi 


Sq/q  £o  -h  i£p(JifG) 
Sg/o  +  SsA  D{Kg) 


(7.21) 


g,  1  5sA  Eb  -1-  '%Eg{K%) 

"2  ®5g/o  +  Ss/i  D(,Ks) 


(7.22) 


as  ipven  in  Appendix  7.A  and  detailed  in  [7.22].  These  exptearions  correspond 
very  closely  to  (7.17),  the  first  order  field  expression  derived  by  Kvkhtnnv  et 
al.  [7.18, 19].  The  effective  modulation  depths  and  in  (7.21,22)  which 
account  for  the  reduction  of  the  original  modulation  depths  niQ  and  mg  by 
the  presence  of  both  the  incoherent  image  and  the  coherent  grating  beams  are 
defined  by 


•fF 

=^'sGi7~s7h 


(7.23) 


ms 


®5g/o  +  5sA 


(7.24) 


The  expressions  (7.21, 22)  can  be  simplified  over  broad  operating  regions  as 
follows.  For  spatial  frequencies  less  than  of  order  200  cycles/mm  and  bias  fields 
over  2  kV/cm,  and  assiitning  the  equilibrium  donor-like  tr^  density  given 

in  Table  7.1,  the  denominator  terms  D{Kg)  and  D{Ks)  can  be  approximated 
by  unity,  and  the  diffusion  field  £0  can  be  neglected  in  favor  of  the  applied 
bias  field.  For  higher  spatial  frequencies,  the  denominator  factors  D(Kg)  and 
D{Ks)  and  the  diffttsko  field  £d  primarily  contribute  a  phase  shift  to  the 
coherent  grating’s  charge  distribution,  with  negligible  degradation  of  its  mag¬ 
nitude.  (This  assertion  is  justified  in  the  discussioo  on  material  limitations  in 


Sect.  7.4.3.)  Thus  the  lowest  harmonics  of  the  space-charge  field  can  be  approx¬ 
imated  by 

£l0  =  -^m^Eo  (7.25) 

£oi  =  ~^mfEo  ■  (7.26) 

By  invoking  similar  ^proximations,  the  magnitude  of  the  intermodula¬ 
tion  term  £ii,  which  is  crucial  to  the  incoherent-to-coherent  optical  conversion 
process,  is  well  approximated  by 

Ell  =  .  (7.27) 

as  discussed  in  Sect.  7.4.3  and  in  [7.22].  An  identical  expression  for  £ii  was 
derived  by  Marrakcki  et  al.  [7.8],  starting  from  the  constant  recombination  time 
approximation  discussed  in  Sect.  7.3.1.  This  derivation  involves  an  additional 
linearization  of  the  response  in  the  limit  of  low  modulation  depths  and 
mf. 

In  addition  to  the  magnitude  expressed  by  (7.27),  the  perturbation  series 
analysis  predicts  that  the  £ii  field  is  phase  shifted  with  respect  to  the  incident 
coherent  grating.  For  steady  state  response  in  SEWM ,  this  phase  shift  does  not 
significantly  impact  the  performance  of  the  conversion  process,  xiiming  that 
it  remains  reasonably  constant  over  the  recording  bandwidth;  for  temporal 
response  it  can  significantly  degrade  the  usefulness  of  PICOC  for  particular 
optical  processing  architectures. 

Equations  (7.25-27)  indicate  that  the  SEWM  response  in  the  steady  state 
limit  is  predominantly  governed  by  the  reduced  modulation  depths  and 
m^.  The  consequent  impact  on  the  overall  readout  inuge  light  intensity  and 
on  the  modulation  transfer  characteristic  is  discussed  in  the  nonlinear  transfer 
response  analysis,  presented  next. 


7.4.2  Nonlinear  Transfer  Response 

The  image  transfer  for  PICOC  involves  a  performance  trade-off  between  two 
competing  in*<»h«ni«iM!  the  contrast  ratio  (which  involves  the  ratio  of  In  and 
/lo)  improves  steadily  with  increasing  intensities  of  incoherent  image-bearing 
light  (in  the  absence  of  spatial  filtering),  but  at  the  same  time  the  average 
intensity  (which  involves  /xo  alone)  steadily  declines  with  increasing  incoherent 
intensity  because  the  uniform  background  in  the  incoherent  light  erases  the 
carrier  grating  pattern  in  the  photorefractive  crystal. 

In  many  optical  information  processing  applications,  optimization  of  the 
image  contrast  ratio  is  desirable  within  the  image  intensity  constraints  implied 
by  the  performance  trade-off  described  above.  In  other  types  of  signal  process¬ 
ing  applications  such  as  correlation  with  a  Vender  Lugt  filter,  the  dc  image 
content  contained  in  the  fio  difibaction  component  does  not  contribute  to  the 
processing,  or  can  be  readily  modified  by  spatial  filtering,  whereas  maximizing 


the  intensity  of  nonzero  spatial  frequency  components  such  as  /n  is  critical  to 
good  conversion  performance.  For  these  cases,  an  optimum  level  of  incoherent 
image-bearing  beam  intensity  exists  for  maximizing  the  /n  component,  beyond 
which  the  /n  component  decays  because  of  space-charge  erasure.  Therefore  the 
following  study  of  the  nonlinear  transfer  response  includes  explicit  considera¬ 
tion  of  the  /ii  diffwtion  order  term. 

To  obtain  a  quantitative  estimate  of  these  effects,  consider  as  a  represen¬ 
tative  example  the  simultaneous  erasure/ writing  mode  (SEWM)  in  the  steady 
state  regime,  with  a  single  spatial  frequency  coherent  grating  given  by  (7.1), 
and  a  single  spatial  frequency  incoherent  image  profile  given  by  (7.2).  One 
possible  method  of  assessing  the  image  transfer  response  is  to  determine  the 
modulation  depth  of  the  readout  image  as  a  function  of  the  input  (incoher¬ 
ent  and  coherent)  light  characteristics.  A  convenient  parameter  that  describes 
these  characteristics  is  the  product  B  R,  in  which  R  —  li/Io  is  the  ratio  of  the 
average  incoherent  image  light  to  the  average  coherent  grating  light  intensity 
levels,  and  B  —  Ss/Sq  is  the  ratio  of  the  photoconductive  sensitivity  of  the 
incoherent  image  beam  with  respect  to  the  coherent  grating  beam.  With  these 
definitions,  the  effective  modulation  depths  and  defined  by  (7.25,26) 
can  be  expressed  as 


_  msBR 
®  l  +  BR 


The  directed  intensty  Iio  is  directly  proportional  to  the  square  of  m^,  while 
the  modulation  depth  of  the  readout  image  is  similarly  proportional  to  m^. 
The  image  transfer  response  is  plotted  in  Fig.  7.13,  which  shows  the  improve- 
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Fig.  7.13.  NomwliMd  diilitactiaii  sfliewiKy  of  the  /to  boam  (uniform  erasure),  and  the  mod¬ 
ulation  transfer  function  for  modulatod  erasure,  as  a  function  of  the  intensity  ratio  R 
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meat  in  modulation  depth  of  the  readout  light  beam  with  increasing  levels 
of  incoherent  image  illumination,  but  with  concomitant  decay  of  the  average 
transferred  image  intensity  /iq. 

An  unusual  but  intuitive  feature  of  the  photorefractive  incoherent-to-co> 
herent  conversion  process  (as  opposed  to  more  typical  linear  spatial  light  mod¬ 
ulation  techniques)  is  that  the  image  quality  is  primarily  determined  by  the 
ratio  of  the  incoherent  image  intensity  to  the  coherent  grating  intensity,  not 
by  the  sum  of  these  intensities  (assuming  negligible  dark  conductivity  3  in  the 
photogeneration  rate  equation  (7.9)).  Remember,  however,  that  this  analysis 
eusumes  a  recording  process  that  has  already  reached  steady  state,  and  hence 
does  not  address  the  question  of  the  time  reqtiired  to  reach  saturation,  which 
is  indeed  a  function  of  the  total  light  intensity.  Temporal  response  issues  are 
considered  in  Sect.  7.4.4  below. 

To  test  the  predictions  of  this  nonlinear  transfer  model,  the  erasure  of  the 
readout  beam’s  first  dii&action  order  Im  in  response  to  spatially  iinifonn  inco¬ 
herent  illumination  has  been  measured  by  the  nondegenerate  four  wave  mixing 
configuration  discussed  in  Sect.  7.2,  with  results  as  shown  in  Fig.  7.14.  The  grat¬ 
ing  was  written  with  the  SIS  nm  line  of  an  argon  ion  laser,  while  the  488  nm 
line  was  used  to  simulate  the  spatially  uniform  image  beam.  For  comparison, 
predictions  of  two  different  models  of  the  conversion  process  are  included  in 
Fig.  7.14.  The  curve  labeled  “linear  ^proximation”  corresponds  to  the  per¬ 
turbation  anal}rsia  term  £io  presented  in  this  chapter,  while  the  second  curve 
labeled  “constant  recombination  time  approximation”  derives  from  (7.9)  of  the 
previously  published  model  [7.8],  which  in  turn  is  based  upon  the  saturation 
regime  mo^l  of  Moharam  et  aL  [7.16].  The  linear  approodmation  model  has 
been  scaled  in  intensity  to  match  the  experimental  points  at  iZ  s  0.6,  and  the 
constant  recombination  time  approximation  (Moharam’s  analytical  solution) 
has  been  scaled  to  converge  with  the  linear  approximation  for  very  large  beam 
ratios  iZ. 


Fig.  7.14.  Espwimsntal  dilhaction  eft- 
ciwicy  of  (ho  /io  boam  (uaiietiii  orMUi*) 
a  function  of  the  intensity  retie  R. 
Corrceponding  theoretical  piedictieas  for 
the  linearised  model  and  the  constant 
recombination  time  model  are  also  ehoem 
for  comparison 


Note  that  the  experimental  data  points  and  both  models  converge  reason¬ 
ably  well  for  high  levels  of  incoherent  image  illumination,  which  correspond  to 
large  intensity  ratios  R  that  by  (7.28)  imply  small  effective  modulation  depths 
m^.  Recall  from  Fig.  7.12  that  small  inodulation  depths  correspond  to  the 
most  accurate  region  of  the  Unearized  modds  of  the 'recording  process,  as  as¬ 
sumed  in  the  lowest  order  terms  of  the  perturbation  analysis  (as  well  as  in 
the  linearized  constant  recombination  time  i^proximation  discussed  following 
(7.27),  which  was  shown  to  yield  expressions  idratical  to  the  perturbation  anal¬ 
ysis).  In  contrast,  for  small  intensity  ratios  R  with  concomitantly  large  effective 
modulation  depths  m^,  the  full  nonlinear  constant  recombination  time  model 
as  developed  in  [7.8]  offers  a  significantly  better  recording  i4>proximation  than 
the  linearized  models  for  the  simultaneous  erasure/writing  mode  (SEWM)  in 
saturation,  as  shown  clearly  in  Fig.  7.14  by  the  aunre  marked  “constant  recom¬ 
bination  time  approKunation”.  Note  in  addition  that  the  choice  of  scaling  of  the 
constant  recombination  time  approximation  does  not  automatically  guarantee 
good  agreement  with  the  data  near  R  =  0. 

A  more  challenging  test  of  the  theny  is  to  predict  accurately  the  conversion 
response  to  a  sinusoidally  modulated  image  beam.  Such  a  test  can  be  performed 
with  the  nondegenerate  four  wave  mixing  geometry  described  previously  in 
which  a  488  nm  argon  ion  laser  line  passes  through  a  Michelson  interferometer  to 
generate  a  sinusoidal  spatial  modulation,  as  shown  in  Fig.  7.15.  The  sinusoidal 
image  modulation  introduces  an  additional  diffraction  order  Jn  not  observed 
in  the  uniform  erasure  case.  The  diffracted  light  sideband  intensity  Jn  has 
been  measured  as  a  function  of  the  intensity  ratio  R,  with  results  as  shown  in 
Fig.  7.16,  and  as  compared  agiunst  the  theoretical  prediction  of  the  perturbation 
expansion  method. 

The  general  features  of  Fig.  7.16  can  be  understood  with  reference  to 
Fig.  7.13.  For  low  levels  of  image  intensity,  corresponding  to  low  intensity  ra¬ 
tios  R,  the  coherent  grating’s  charge  pattern  is  not  significantly  erased  by  the 
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Fig.  7. IS.  ExperiiMiital  arrangtnwnt  for  Mfisitivity  Md  tronsfat  function  moMurcmcnu,  as 
dcacribod  in  dMsil  in  th«  tut 


Pig.  7.16.  Nonnalizcd  experimental 
dilTraction  efficiency  of  the  l\i  beam 
(strongly  modulated  erasure)  as  a  func¬ 
tion  of  the  intensity  ratio  R.  The  corre¬ 
sponding  theoretical  prediction  is  also 
shown  for  comparison 


image  beam,  aa  indicated  by  the  high  level  of  /jq  in  Fig.  7.13^.  In  this  regime, 
increasing  the  incoherent  image-bearing  light  intensity  increases  the  transfer 
of  image  modulation  onto  the  space-charge  grating  profile  without  destiojring 
that  profile.  For  high  levels  of  incoherent  light  intensity,  corresponding  to  larger 
intensity  ratios  R,  the  transfer  of  image  modulation  onto  the  coherent  grating’s 
space-charge  field  profile  is  very  high  as  shown  by  the  readout  beam’s  modula¬ 
tion  depth  curve  in  Fig.  7.13,  but  the  high  intensity  of  the  uniform  incoherent 
image  light  Ii  strongly  erases  the  coherent  grating’s  space-charge  profile,  as 
shown  by  the  /lo  curve  in  Fig.  7.13.  Hence  the  In  diffracted  light  component, 
which  is  derived  from  the  combination  of  the  modulation  depth  and  the  average 
grating  intensity  /lo,  exhibits  the  peaking  behavior  shown  in  Fig.  7.16. 

The  match  between  the  theoretical  curve  in  Fig.  7.16  and  the  experiment 
is  not  expected  to  be  perfect  because  the  perturbation  theory  requires  small 
modulation  depths  mo  and  ms  for  reasonable  accuracy,  whereas  the  exper¬ 
iment  is  configured  with  the  highest  possible  modulation  depths.  To  impose 
small  modulation  depths  in  the  experiment  would  make  the  di&acted  lu  hg^t 
intensity  too  weak  to  measure  reliably,  so  a  more  exacting  test  of  the  theory 
must  await  munerical  modeling  of  high  modulation  depth  transcriptions. 

An  additional  difficulty  in  achieving  agreement  between  the  theoretical 
models  and  the  experiment  concerns  the  appropriate  value  for  the  photocon- 
ductive  sensitivity  ratio  B.  For  the  experiment  described  in  Fig.  7.14,  in  which 
the  coherent  writing  wavelength  is  515  nm,  and  the  incoherent  image  wave¬ 
length  is  488tun,  an  estimate  of  H  w  2  can  be  derived  firom  a  single  photon 
absOTption  model  that  assumes  the  quantum  efficiencies  of  the  pbotocimductive 
processes  for  both  the  ctdierent  grating  beam  and  the  incoherent  image  beam 
to  be  identical  [7.8].  However,  measurements  of  the  photoconductive  quantum 
efficiency  reported  by  5yreyae  [7.30]  show  a  strong  dependence  on  the  light 
wavelength,  such  that  an  increase  in  the  sensitivity  ratio  B  br  factor  of  1.5 
can  reasonably  be  argued  due  to  this  dispersiaa  of  quantum  efficiency.  FVirther- 


more,  this  factor  can  be  expected  to  vary  from  one  crystal  sample  to  another, 
depending  upon  the  detailed  growth  conditions.  On  the  other  hand,  a  reduc¬ 
tion  of  the  effective  sensitivity  ratio  B  of  as  much  as  1.7  can  also  be  argued 
for  the  crystal  thickness  used  in  this  experiment  because  of  the  dispersion  of 
the  optical  absorption,  i.e.,  the  ratio  of  coherent  grating  to  incoherent  image 
beam  intensities  changes  continuously  as  both  beams  propagate  through  the 
crystal  [7.8].  Because  of  these  conflicting  arguments,  the  nomi^  ratio  ofB  =  2 
in  Figs.  7.14  and  7.16  has  been  assumed  for  the  wavelengths  considered.  This 
gives  a  good  fit  for  the  uniform  erasure  /lo  experimental  points,  especially  for 
the  constant  recombination  time  approocimation,  but  this  ratio  causes  the  pre¬ 
dicted  modulated  erasure  Ii\  curve  to  peak  at  a  significantly  lower  intensity 
ratio  R  than  is  indicated  by  experiment. 

In  summary,  the  broad  features  oi  the  recording  nonlinearities  are  well 
understood  and  successfully  modeled  by  the  perturbation  series  approach,  e.g., 
the  decay  of  the  coherent  grating  pattern  isith  increasing  amounts  of  image 
intensity,  and  the  dependence  of  the  In  diffracted  light  intensity  component 
on  exposure  parameters.  Detailed  agreement  will  require  both  further  analy¬ 
sis  to  model  the  nonlinearities  more  accurately,  and  better  information  about 
the  sravelength  dispersion  of  the  photoconductive  sensitivities  of  the  coherent 
grating  and  the  image-bearing  beams. 

7.4.3  Spatial  Resolution  Issues  for  the  Recording  Process 

A  number  of  distinct  factors  influence  the  ultimate  resolution  achievable  with 
the  PICOC  spatial  light  modulator.  These  factors  can  be  classified  as  geometric, 
configurational,  and  materials  related  in  nature.  The  geometric  and  materials 
related  factors  influence  the  recording  of  the  image,  and  hence  are  reviewed  in 
this  section.  The  configurational  factors  influence  the  readout  of  the  volume 
gratings,  and  as  such  are  reviewed  in  the  subsequent  section. 

Geometrical  LimHatioos.  Geometric  resolution  limitations  derive  principally 
from  the  incorporation  of  an  incoherent  imaging  system  in  the  four-wave  mix¬ 
ing  geometry,  and  from  the  finite  crystal  thickness  d  required  to  create  a  vol¬ 
ume  holographic  grating.  These  effects  are  illustrated  in  Fip.  7.17  and  7.18. 
Distinctly  different  resolution  performance  is  expected,  depending  upon  the 
optical  absorption  coefficients  oq  of  the  coherent  grating  light  and  og  of  the 
incoherent  image  light. 

Figure  7.17  describes  the  case  for  low  optical  absorption  (oQtlCl  and 
asd<l),  such  that  the  induced  holographic  grating  has  essentially  uniform 
amplitude  throughout  the  volume  of  the  crystal.  As  can  be  expected  from 
physical  considerations,  the  optimum  focal  point  occurs  in  the  center  of  the 
crystal,  and  is  not  localised  on  the  front  surface  of  the  crystal.  The  spatial 
frequency  response  will  then  be  proportional  to  (VF/2)~^,  which  in  turn  is 
equal  to  4noF'#/d  for  the  case  of  1:1  imaging,  in  which  W  is  the  diameter 
of  the  incoherent  image  beam  at  the  front  surface  of  the  crystal  (as  shown  in 
Fig.  7.17),  no  is  the  refractive  index  of  the  electrooptic  crystal,  F#  is  the  F- 


Fif.  T.IT.  Geometrical  con- 
strainu  impoaed  on  PICOC  rea- 
olution  by  the  fliiite  F-number 
of  the  image  input  optica  for  the 
caae  ad<  1.  A  diatance  of  2/  be¬ 
tween  the  imaging  lena  and  the 
photorefractive  crystal  has  been 
assumed  in  this  a^  in  the  next 
figure,  corresponding  to  a  pre¬ 
sumed  1 ;  1  lens  magnification 
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Fig.  T.lg.  Geometrical  con¬ 
straints  impeead  on  PICOC 
olulioa  fay  the  finite  F-nundier 
of  the  image  input  optica  for  the 
caae  atd>l 


number  of  the  incoherent  imaging  system,  and  d  is  the  crystal  thickness.  For 
example,  for  ng  s  2.5,  d  s  1  mm,  and  an  F-number  of  5,  the  restdution  limit 
is  approximately  SO  cycles/mm. 

In  contrast,  Hg.  7.18  describes  the  case  for  which  aad  >  1,  such  that  the 
induced  holographic  grating  has  significant  an^>litude  only  within  a  thin  layer 
of  thickness  d^  a  Oq^  In  this  case,  the  resolution  is  ^ven  by  4noF#aG.  For 
rto  =  2.5,  d  m  Imm,  oq  m  10Qcm~^,  and  an  F-number  of  5,  the  resolution 
limit  is  appirarimately  500  cycles/mm,  a  factor  of  10  improvement  in  resolution. 
However,  the  difiraetka  efficiency  is  reduced  by  a  factor  of  mder  100  because  of 
the  reduction  in  effective  thickness  of  the  grating.  If  the  abeoeptitm  coefficient 
as  is  chosen  to  be  significantly  larger  than  oq,  then  the  resolution  will  be 
constrained  by  og  instead  of  oq. 

Material  Lindtatiens.  An  additional  resolution  limitation  stems  bom  material- 
dependent  parameter  constraints  which  infiuence  the  physics  of  grating  fonnar 
tion,  in  particular  the  finite  supply  of  compensating  traps  whidi  is  equal 
to  the  equilibrium  donor-like  trap  density  If  the  trap  density  is  limited. 
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then  the  space-charge  field  that  can  be  recorded  is  similarly  limits  because  suf¬ 
ficient  space  charge  cannot  be  generated  to  establish  any  Ugher  field  strengths. 
This  limitation  becomes  progressively  more  severe  at  higher  spatial  frequen¬ 
cies  as  expressed  by  the  D(Kg)  factor  in  (7.17),  in  which  the  trap-limited 
saturation  space-chu ge  field  is  a  fimction  of  the  spatial  frequency  and  the 

equilibrium  donor-like  trap  density  as  shown  in  (7.20).  The  reduction 

in  space-charge  field  with  an  increase  in  spatial  frequency  (as  exhibited  by  a 
corresponding  decrease  in  E^)  is  shown  in  Fig.  7.12.  If  an  unlimited  supply  of 
compensating  traps  were  available  for  establishing  the  space-charge  field,  then 
the  saturation  field  would  be  infinite  and  the  factor  D(Kg)  would  converge 
to  unity,  indicating  nef^gible  degradation  in  the  space-charge  field.  In  practice, 
the  finite  level  of  compensating  traps  leads  both  to  a  reduction  of  the  space- 
charge  field  and  to  a  phase  shift  as  consequences  of  the  finite  saturation  field. 
These  two  effects  are  discussed  in  more  detail  below. 

Consider  for  example  an  equilibrium  donor-like  trap  density  (Vq  ^  of  10^* 

cm~^  and  a  coherent  grating  frequency  of  300  cyclea/mm,  which  implies  a  sat¬ 
uration  field  E^  of  order  19  kV /cm  and  a  diffusion  field  Eq  of  order  0.5  kV /cm. 
For  an  applied  bias  field  of  6kV/cm,  the  effect  of  the  factor  D{Kg)  in  (7.17) 
is  only  a  5  %  reduction  in  the  magnitude  of  the  induced  space-charge  field  in 
the  linear  approximation.  As  these  parameters  are  typical  of  PICOC  opera¬ 
tion,  the  resultant  effect  induced  by  this  mechanism  on  the  spatial  frequency 
response  is  therefore  negligible  compared  with  alternative  response  degrada¬ 
tion  mechanisms  such  as  Bragg  detuning  on  readout,  as  discuned  in  the  next 
section. 

The  addition  of  harmonic  components  to  the  image  will  in  general  intro¬ 
duce  an  additional  amplitude  variation  in  the  space-charge  field  through  the 
denominator  factor  D(K)  in  (7.17)  (as  shown  explicitly  in  (7.A4)  of  Appendix 
7.A).  This  variation  is  negligible  compared  with  the  effect  induced  by  the  co¬ 
herent  grating  spatial  frequency  described  above. 

To  verify  the  predicted  high  bandwidth  of  the  recording  process  and  to 
eliminate  depth  of  focus  issues  as  discussed  previotisly,  the  Michelson  interfer¬ 
ometer  configuration  shown  in  Fig.  7.15  was  used  to  record  sinusoidal  image 
patterns  onto  a  bismuth  silicon  oodde  crystal.  The  ratio  R  of  the  image-bearing 
light  intensity  Ii  to  the  coherent  grating  light  intensity  /q  was  a4)usted  exper¬ 
imentally  to  maxiiniee  the  intensity  of  the  Jn  diffraction  order.  As  the  image 
grating  frequency  was  varied,  the  angular  alignment  of  the  readout  beam  was 
also  varied  to  maintain  optimum  Bragg  angle  alignment,  thereby  removing 
Bragg  detuning  effects  on  the  readout  resolution.  Thus  the  measured  diffracted 
intensity  I\\  corresponds  to  the  strength  of  the  space-charge  field  stored  in 
the  crystal,  with  results  as  shown  in  Fig.  7.19.  A  slight  decrease  in  diffrac¬ 
tion  efficiency  with  increasing  spatial  frequency  is  observed,  but  the  effective 
bandwidth  for  grating  writing  far  exceeds  the  expected  bandwidths  for  Bragg 
detuning  on  readout,  as  discussed  in  the  readout  section  below. 

A  far  mote  serious  implication  of  (7.17)  for  the  space-charge  field  compo¬ 
nent  El  is  a  shift  in  the  phase  of  the  recorded  space-charge  field  profile  with 
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Fig.  r. 19.  Meuured  reaolution  re- 
spoiwe  of  the  phototefrsctive  record¬ 
ing  proceM,  limited  by  materiel  de¬ 
pendent  eonetrainta.  These  measure- 
menta  were  performed  by  continu- 
oualy  Bragg  matching  the  In  order, 
thereby  removing  any  dependence 
of  bandwidth  on  Bragg  detuning,  aa 
discuaaed  in  Sect.  7.5 


respect  to  the  incident  coherent  grating  illtunination  profile.  Considering  again 
an  equilibrium  donor-like  trap  density  of  10**cm~^,  a  coherent  grating 
frequency  of  300  cycles/mm,  and  an  applied  bias  field  of  6kV/cm,  the  resul¬ 
tant  phase  shift  is  of  order  18".  This  phase  shift  poses  no  problems  for  PICOC 
performance  so  l<mg  aa  it  remains  uniform  over  the  full  aperture  of  the  photore- 
fractive  crystal  and  the  full  bandwidth,  as  it  in  fact  varies  weakly  with  ATg  and 
mg.  On  the  other  hand,  an  image-induced  differential  phase  shift  can  prove  to 
be  problematic  for  particular  optical  processing  architectures. 

Resolution  Anisotropy.  A  final  resolution  issue  that  has  been  predicted  by  the 
perturbation  series  is  a  moderate  anisotropy  in  the  recording  of  image  structures 
parallel  as  opposed  to  perpendicular  to  the  applied  bias  field  for  the  simultane¬ 
ous  erasure/ writing  mode  (SEWM),  and  a  severe  resolution  anisotropy  for  the 
grating  inhibition  mode  (GIM).  Features  of  this  analysis  are  qtiite  intriguing 
and  hence  are  briefly  reviewed  here. 

The  perturbaticm  series  analysis  is  conducive  to  a  physically  intuitive  inter¬ 
pretation  the  perturbation  terms  as  a  seqtience  of  discrete  recording  events. 
For  the  simultaneous  erasure/ writing  mode  (SEWM),  two  recording  paths  con¬ 
tribute  to  the  III  di&action  order.  In  one  recording  path,  the  incoherent  image 
writes  a  space-charge  field  Eqi,  independent  of  the  coherent  grating  profile. 
This  space-charge  field  £oi  then  modulates  the  recording  of  the  coherent  grat¬ 
ing  light  beam.  This  transcription  path  is  analogous  to  the  grating  inhibition 
mode  (GIM).  In  the  second  SEWM  transcription  path,  the  coherent  grating 
profile  writes  a  spaoe-diarge  field  £19,  which  then  modulates  the  recording  of 
the  incoherent  image-bearing  beam.  This  second  path  is  analogous  to  the  grat¬ 
ing  erasure  mode  (GEM).  The  GEM-like  path  e^^bits  nearly  perfect  isotropy 
of  response  to  an  arbitrary  image,  but  the  GIM-lilce  path  exhibits  a  very  strong 
anisotropy,  with  image  structures  (Rented  perpendicular  to  the  applied  bias 
field  generating  much  weaker  space-charge  fields  than  structures  cmented  par¬ 
allel  to  the  bias  field. 

To  elaborate,  consider  an  image  profile  consisting  of  a  single  spatial  fre¬ 
quency,  similar  to  (7.2),  but  oriented  such  that  the  wave  vector  £$  >*  orthogonal 


to  the  applied  bias  field  £o-  Thus  the  incideot  image  intensity  /s(y)  is  given 
by 

^s(y)  =  A  (1  +  ms  cos  lfs»)  .  (7.30) 

in  which  the  y  axis  is  orthogonal  to  the  i^>plied  bias  field  Eo-  This  image  profile, 
in  combination  with  the  coherent  grating  profile  as  given  by  (7.1),  induces  a 
space-charge  field  E{x,y)  of  the  form 

E{x,y)  =  i£io  cos  Kqx  -t-  yfioi  cos  Ksy 

+  i^^nx  +  c®*  CO®  ^sy 

-h  higher  order  terms  (7.31) 

in  which  x  and  y  are  unit  vectors  parallel  and  perpendicular  to  the  applied  bias 
field  respectively. 

Terms  such  as  £oi  and  Ehy  involve  recording  a  charge  pattern  along 
a  directional  orthogonal  to  the  applied  bias  field,  and  hence  do  not  benefit 
from  the  enhancement  of  the  photoconductivity  induced  by  this  applied  field. 
In  practice,  these  terms  are  very  much  smaller  than  terms  sudi  as  Eio  and 
Eiix  which  involve  recording  a  charge  pattern  along  a  direction  parcel  to 
the  applied  bias  field.  Analysis  using  the  perturbation  method  shows  that  the 
Eiix  term  is  reduced  by  a  factor  ot  two  compared  with  the  En  term  given 
by  (7.5)  that  results  when  the  image  wave  vector  Ks  is  parallel  to  the  a4;>plied 
bias  field  £n. 

Thus  we  find  that  the  material  limitations  on  recording  resolution  are  neg¬ 
ligible  compared  with  the  geometrical  limitatioiui,  which  in  turn  are  of  compa¬ 
rable  magnitude  to  the  Bragg  sensitivities  on  readout  to  be  discussed  in  the 
Sect.  7.5. 

7.4.4  Temporal  Response 

A  temporal  response  analysis  is  necessary  for  the  study  of  the  grating  inhibition 
mode  (GIM)  and  the  grating  erasure  mode  (GEM)  since  these  involve  tempo¬ 
ral  sequencing,  and  the  issue  of  timing  is  crucial  to  the  optimisation  of  their 
performance.  The  temporal  analysis  is  also  of  interest  for  the  simultaneous  era¬ 
sure/writing  mode  (SEWM)  because  it  clarifies  the  duration  and  nature  of  the 
transient  vmting  period  before  a  stable  response  is  adhieved,  and  also  because 
it  leads  to  a  very  impwtant  reciprocity  law  between  the  incident  light  power 
and  the  response  time  of  the  converter  (assuming  that  the  dark  conductivity  of 
the  photorefiactive  crystal  can  be  neglected).  A  final  and  very  important  issue 
that  arisea  firom  the  temporal  resptmse  is  an  image-induced  phase  modulation 
of  the  coherent  grating’s  charge  profile  that  can  degrade  PICOC  performance 
in  aome  coherent  processing  architectures. 

The  analysis  reported  here  is  restricted  to  small  modulation  regimes  for 
simi^city.  The  results  are  modified  substantially  when  operating  vrith  large 
modulation  depths.  In  particular,  the  transient  period  increases  significantly  in 
such  a  regime. 
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To  introduce  some  of  the  key  concepts,  consider  the  temporal  evolution 
of  the  space-charge  field  in  response  to  coherent  grating  illumination  in  the 
absence  of  am  incoherent  image.  Kukhtarev’s  study  of  this  problem  using  first 
order  perturbation  analysis  [7.19]  (in  the  absence  of  self-dii&action  effects)  has 
shown  that  if  the  photorefractive  crystal  is  illuminated  with  an  intensity 


(” 

(  /o(l  +  »nG  ^G*) 


for  t  <  0 
for  t  >  0 


(7.32) 


then  the  space-charge  field  component  E\(t)  has  the  form 

Exit)  =  +  i£D(^^G)]^(i^G)'‘{l  -exp[  -  t/r(firG)]}  (7.33) 

in  which  D{Kq),  Ed{Kq),  and  E^(Kc)  have  been  defined  by  (7.18-20).  The 
time  constant  T{K)  is  defined  by 


-  T  {l-i[go  +  i-gD(A')l/£M(A-)} 
^  ^  ®{l-i(£o  +  ii:D(A')I/f:,(A')} 


(7.34) 


in  which  To  is  the  dielectric  relaxation  time,  defined  by 

-r  «0 
^0  ■“  1 
/ieno 

and  no  is  the  zeroth  order  estimate  of  the  electron  density  given  by 


(7.35) 


no  =  Sq/O'A^D^  (7-36) 

In  (7.36),  r  is  the  free  carrier  lifetime,  as  given  by 

^  .  (7.37) 

A  group  of  parameters  occun  in  (7.34)  having  the  dimensions  of  an  electric 
field.  This  field  parameter  is  assigned  the  symbol  £m('^g)i  >>  defined  by 

EniKa)  =  {(iKGT)-'  .  (7.38) 

The  physical  interpretatioa  of  this  field  parameter  can  be  best  understood  by 
considering  the  average  transport  length  of  the  mobile  charges.  When  the  drift 
contribution  to  the  current  density  in  (7.12)  dominates  over  the  diffusioa  con- 
tribution,  the  mobile  diarges  during  their  limited  lifetime  r  travel  an  average 
distance  L  a  ftrE  while  under  influence  of  an  electric  field  E.  The  drift  trans¬ 
port  length  is  equal  to  the  grating  period  when  the  field  strength  E  —  2irE}4. 

The  space-riiarge  Add  in  (7.^)  exponentially  approaches  its  saturation 
limit,  genenlly  with  a  complex  time  constant  T{Ka)  which  denotes  osdllar 
tory  as  wdl  as  decaying  behavior.  At  low  sp^ial  frequencies,  the  drift-induced 
transport  length  is  very  short  compared  with  the  grating  period  (equivalent  to 
having  a  field  Em  large  oooqpared  to  the  ^>plied  bias  field  £e).  In  this  case, 
the  dec^  predict  by  (7.33)  is  governed  essentially  by  the  dielectric  relaxatka 
time  To  and  exhibits  negligible  oscillatory  behavior.  This  gives  the  Cutest  possi¬ 
ble  response  time.  For  higher  spatial  frequencies  and  high  applied  biu  fields,  for 
which  the  drift-driven  charge  transport  length  greatly  exceeds  the  grating  pe- 
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riod,  the  response  time  increases  substantially  beyond  the  dielectric  relaxation 
time  and  in  addition  the  response  exhilnts  oscillatory  behavior.  An  intuitive 
interpretation  of  this  phenomenon  is  that  the  finite  transput  length  blurs  the 
charge  pattern  being  transcribed,  forcing  a  bnger  recording  time  to  achieve 
a  given  level  of  charge  profile  modulation.  F\irthennore,  the  blur  pattern  is 
one-sided  because  the  applied  bias  field  forces  the  mobile  charges  always  in  one 
direction,  inducing  a  phase  shift  of  the  diarge  pattern  bong  transcribe.  In  all 
cases,  the  response  time  is  inversely  proportional  to  the  incident  light  intensity, 
so  that  doubling  the  incident  intensity  reduces  the  response  time  by  a  factor  of 
two. 

By  similar  analysis,  one  finds  that  the  erasure  of  the  resulting  space-charge 
field  by  a  spatially  uniform  incident  light  beam  also  exhibits  an  exponential 
response  with  a  time  constant  that  is  inversely  proportional  to  the  erasure 
light  intensity.  In  the  absence  of  an  i^;>plied  bias  field,  this  response  is  described 
by  a  simple  exponential  function  with  no  shift  in  the  phase  of  the  coherent 
grating’s  charge  profile.  In  the  presence  of  an  applied  bias  field,  the  uniform 
erasure  light  beam  induces  a  drift  of  the  charge  pattern  that  was  originally 
recorded  by  the  coherent  grating  beams,  resulting  in  a  phase  modulation  as 
well  as  an  amplitude  modulation. 

Let  us  now  review  the  analysis  of  one  particular  version  of  the  simultaneous 
erasure/writing  mode  (SEWM)  for  the  photoreftactive  incoherent-to-coherent 
optical  conversion  (PICOC)  process.  Consider  a  crystal  in  which  a  coherent 
grating  of  wave  vector  Kq  ItM  been  written  and  has  reached  steady  state.  At 
time  t  s  0,  an  incoherent  image  grating  with  wave  vector  Ks  is  turned  on.  The 
intensity  incident  upon  the  crystal  is  then  described  by 


..  ,,  f  A>(1  +  CO*  •K’g*)  for  f  <  0 

\  /o(l  +  roQ  cos  Kqx)  +  /i(l  +  ms  cos  Ksx)  fm  t>0 

(7.39) 

The  temporal  evolution  of  the  various  components  of  the  space-charge  field 
can  be  solved  by  perturbation  techniques,  valid  for  small  levels  of  the  modula¬ 
tion  depth  m<3,  with  explicit  expressions  for  the  lowest  order  terms  as  presented 
in  Appendix  7.A.  In  particular,  the  temporal  response  of  the  component 
has  the  general  form 


£ix(l)  *  Mo  +  Afi  exp  ( -  t/TiCKc)]  +  Mo  exp  [  -  t/ToC  Jfs)] 

Mj exp  [  -  t/T3(KG  +  Ks)]  -t-  M4  exp  [  -  t/T^iKa^Ks)]  . 

(7.40) 


in  which  the  M’s  are  complex  coefficients  that  depend  upon  the  ^{^ed  field, 
material  parameters,  and  the  incident  light  intensities,  with  explicit  expressions 
as  given  in  Appendix  7.A  and  in  [7.22].  Hme  constants  Ti,  To,  and  T3  are  given 
by  (7.34)  for  spatial  frequencies  corresponding  to  Kq,  Ks,  and  {Kq  -f  Ks),  re¬ 
spectively.  The  fourth  time  constant  T4  is  given  by 

T4(Kg,Ks)  -  (i/T(jro)  +  i/r(ifs)]-*  •  (7.41) 
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The  first  term  Afi  corresponds  to  the  recording  of  the  coherent  grating  wave 
vector  Kq  in  the  absence  of  the  image  profile.  The  second  term  Afj  corre¬ 
sponds  to  the  recording  of  the  image  profile  wave  vector  Ks  in  the  absence 
of  the  coherent  grating.  The  third  term  M3  corresponds  to  the  recording  of 
the  intermodulation  frequency  Kg  +  K$,  and  represents  a  resonant  response 
of  the  system  of  photorefractive  equations.  The  fourth  term  M4  corresponds 
to  the  nonresonant  response  driven  by  the  product  of  the  coherent  grating  and 
the  incoherent  recording  processes,  which  arises  through  the  nonlinearity  of 
the  recording  process.  The  steady  state  value  of  the  En  field  component  is 
governed  by  the  term  Mo. 

The  longest  response  time  to  reach  steady  state  is  comparable  in  value 
to  the  pure  coherent  grating  response  as  discttssed  above  in  (7.34-38)  and  in 
[7.19].  This  overall  response  time  obeys  the  following  reciprocity  law 

in  the  absence  of  appreciable  dark  conductivity  0  in  the  photoiefractive  crystal, 
in  which  the  proportionality  constant  G  involves  only  material  parameters  and 
the  applied  bias  field  £b-  As  a  result,  the  rate  at  which  new  infmnation  can  be 
recorded  is  determined  by  the  total  available  intensity  incident  on  the  crystal. 

Similar  temporal  response  analyses  have  been  performed  for  both  the  grat¬ 
ing  inhibition  mode  (GIM)  and  the  grating  erasure  mode  (GEM)  [7.22].  In  the 
GEM  mode,  the  response  time  constant  oi  the  system  is  inversely  proportioiud 
to  the  incoherent  erasirtg  intensity  /t,  rather  than  the  sum  of  the  incoherent 
and  coherent  intensities.  Conversely,  in  the  GIM  mode,  the  response  time  con¬ 
stant  of  the  system  is  inversely  proportional  to  the  coherent  grating  intensity 
/o. 

Sample  perturbation  analysis  solutioas  for  GEM,  GIM,  and  SEWM  are 
*hown  in  Figs.  7.20,  7.21,  and  7.22  respectively  for  a  1  mW/cm^  average  in¬ 
tensity  coherent  grating  beam  at  51Snm  wavdength  with  a  grating  spatial 
frequency  300  cycles/mm  and  a  small  modulation  depth  m<3,  a  1  mW/cm^ 
average  intensity  inediesent  image-bearing  beam  at  488  nm  wavelength  with 
an  image  spatial  frequency  at  10  cycles/mm  and  also  with  a  small  modulation 
depth  ms,  an  applied  bias  field  of  8kV/cm,  and  with  the  material  parameters 
for  bismuth  silicon  oadde  as  given  in  Table  7.1. 

Figure  7.20  shows  the  GEM  response  for  three  diffiacted  light  ganponents: 
the  direct  reoocding  of  the  unmodulated  coherent  grating  component  Iio,  the 
direct  recording  of  the  incoherent  image-bearing  beam  In,  and  the  intennodu- 
latkm  component  In  (the  image-modulated  grating  component).  The  coherent 
grating  has  been  recorded  to  saturation  and  then  turned  off  befme  the  time 
interval  shown  in  the  figure.  The  time  t »  0  is  defined  when  the  image-bearing 
beam  is  turned  on.  Thus  the  oohermt  grating  frequency  conqwnent  /xg  starts 
at  its  saturated  levd  and  dec^  for  times  t  >  0  because  of  erasure  by  the  image¬ 
bearing  beam.  The  direct  recording  of  the  image-bearing  beam  /gi  grows  from 
an  initial  value  of  zero  to  its  saturation  level.  Note  that  the  reqxHiae  time  for 
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Fit.r.2o  Fig.r.ai 


Fig.  T.aO.  Timperal  «voluti«a  for  th«  craaui*  modt  (GEM)  m  pradictMl  bjr  the 

perturbation  eeriee  analyeia.  Time  (  s  0  etaru  after  the  coherent  grating  baa  been  recorded 
and  the  incoherent  image>bearing  beam  haa  Juat  been  turned  on.  In  generai  the  In  dilftaction 
component  would  be  much  emaller  than  either  the  ho  or  the  /oi  eoinponenta  in  tfaja  and 
in  the  nest  ten  flgurea,  correaponding  to  ■nail  modulation  deptha  and  m^,  but  for 
aimplkity  all  throa  curvea  ate  ahoem  aa  if  thaaa  modulation  deptha  enre  unity 

Fig.  7.31.  Temporal  evolution  for  the  grating  inhibitioa  mode  (GIM)  aa  predictad  by  the 
perturbation  aeriaa  analyaia.  Time  t  a  0  atarta  after  the  incoherent  image  hearing  beam  haa 
been  recorded  and  the  coherent  grating  beama  have  juat  been  tuntad  on 


the  /oi  image  component,  with  its  much  lower  spatial  frequency,  is  significantly 
faster  than  that  for  the  Iio  coherent  grating  component,  with  its  order  of  mag¬ 
nitude  higher  spatial  frequency.  The  image-modulated  grating  component  In 
exhibits  a  temporal  response  which  is  derived  from  a  combination  of  the  Iio 
and  loi  response,  eventtially  evolving  into  a  slow  decay  in  time  when  the  inco¬ 
herent  image-bearing  light  beam  erases  the  coherent  grating.  The  In  intensity 
is  eventually  erased  by  the  image-bearing  beam  for  very  long  recording  times, 
so  that  the  image-bearing  light  exposure  time  must  be  truncated. 

Figure  7.21  shows  a  similar  set  of  temporal  response  curves  for  the  grating 
inhibition  mode  (GIM).  The  image-bearing  light  has  been  recorded  to  satura¬ 
tion,  then  turned  off  before  the  time  interval  shown  in  this  figure.  Time  t  —  0 
is  defined  when  the  coherent  grating  light  is  turned  on.  In  this  mode,  the  pure 
coherent  grating  component  Jio  starts  with  zero  intensity  and  gradually  grows 
to  saturation,  whereas  the  directly  recorded  incoherent  image-bearing  beam 
/qi  starts  from  its  saturation  level  and  is  quite  ruidly  erased  by  the  uniform 
component  of  the  coherent  grating  light.  The  image-modulated  grating  compo¬ 
nent  III  ahows  initially  a  very  r^id  rise,  followed  by  a  much  slower  erastue  by 
the  coherent  grating  beam,  eventually  decaying  to  zero  for  very  long  recording 
times.  In  GIM,  the  image-modulated  grating  In  component  generally  falls  far 
short  of  its  levels  for  GEM  and  SEWM,  at  least  as  predicted  by  the  pertur- 
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Fif.r.a3  Pig.  r. 33 

Fig.  7.33.  Temporal  evolutioa  for  one  venion  the  eimultaoeoue  eraaure/ writing  mode 
(SEWM)  ae  predicted  by  the  perturbation  aertea  analyaia.  Time  t  s  0  atarta  after  the  coherent 
grating  haa  been  recorded  and  the  incoherent  image-bearing  beam  haa  juat  been  turned  on 

Fig.  7.33.  Meaaured  diiliraetion  eAcieney  of  the  fit  beam  (strongly  modulated  eraauie)  aa 
a  function  of  time  for  two  valuea  of  the  signal  intensity  for  the  simultaneous  erasure/writing 
mode,  shown  for  comparison  with  the  comaponding  theoretically  predktad  tesponsa  curves 
from  the  perturbatiott  aeries  analyaia.  in  this  figure,  fs  is  the  intensity  of  the  signal  baam(s), 
fa  is  the  intensity  of  the  grating  recording  beams,  and  fn  is  the  intensity  of  the  readout 
beam 

b&tion  series  analyaia,  because  the  direct  image  recording  is  erased  before  the 
coherent  grating  recording  has  a  chance  to  grow. 

Figure  7.22  presents  the  simultaneous  erasure/writing  mode  response,  as¬ 
suming  the  temporal  sequencing  given  by  (7.39).  The  pure  coherent  gratmg 
recording  /to  starts  with  its  initial  saturatkm  level  at  time  f  s  0,  and  slowly 
drops  to  a  reduced  saturation  level.  The  direct  recording  of  the  incoherent 
image-beating  beam  /oi  nqndly  builds  &x»i  zero  intensity  to  saturation.  The 
image-modulated  grating  component  /u  exhibits  strong  oscillations,  eventu¬ 
ally  settling  at  its  saturatitm  level,  with  a  response  time  much  longer  than  that 
of  the  /qi  component. 

Measurements  of  the  temporal  response  of  the  In  diffiraction  order  are 
compared  in  Fig.  7.23  with  the  temporal  response  solutions  generated  by  the 
perturbation  series  analysis  for  the  eases  of  intensity  ratios  A  a*  1.5  and  R  s  5.0 
and  for  a  pating  written  in  the  nondegenerate  simultaneous  erasure/writing 
mode  (SEWM).  The  experiments  ate  shown  as  solid  lines,  the  theoretical  pre¬ 
dictions  as  dashed  lines.  In  this  figure,  the  cdierent  grating  is  established  in 
the  saturation  regime  at  time  t  *  0,  at  which  point  the  incoherent  erasure 
beam  is  allowed  to  expose  the  crystal,  as  given  by  (7.39).  Fbr  a  small  R  ratio 
(A  «  /s  -  0.6mW/cm^  in  Fig.  7.23),  the  experimental  diffractioo  efifidency 
increases  monotonically,  at  least  within  the  time  interval  of  this  figure.  For 
a  strong  incoherent  beam  (Ji  s  /$  *  2mW/cm^  in  Fig.  7.23),  a  transient 
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effect  appears  in  the  experimental  response  within  this  same  time  interval.  Ini¬ 
tially,  the  incident  beam  diffracts  from  the  composite  grating  at  wave  vector 
(Ko  +  to  generate  a  rapid  rise  in  the  amplitude  of  /u,  but  the  strong 
incoherent  illumination  eventually  erases  the  coherent  grating  and  hence  the 
diffraction  efficiency  decreases  to  a  small  steady-state  value. 

The  theoretical  curves  are  scaled  in  peak  intensity  and  in  dielectric  relax¬ 
ation  time  to  achieve  a  reasonable  match  with  the  experiments,  but  once  the 
scale  is  defined  for  one  curve,  it  fixes  the  scale  for  both  theoretical  curves.  The 
dielectric  relaxation  time  needed  to  achieve  agreement  between  the  theoretical 
curves  and  the  experimental  data  is  a  factor  3  slower  than  that  derived  from 
the  bismuth  silicon  oxide  parameters  given  in  Table  7.1  per  (7.35-37).  When 
comparing  the  theoretical  with  the  experimental  curves  in  Fig.  7.23,  it  should 
be  noted  that  the  theory  is  most  acctirate  for  low  modulation  depths,  whereas 
the  experiment  is  performed  with  the  highest  possible  modulation  depths  for 
both  the  coherent  grating  and  the  incoherent  image  beams.  Even  so,  the  match 
between  the  theoretical  response  and  the  measured  response  is  quite  striking. 

The  time  constant  for  this  particular  set  of  experimental  parameters  is 
in  the  range  0.5  to  1.5s.  To  achieve  conversion  rates  of  30  frama  per  second 
in  bismuth  silicon  oodde,  a  total  light  intensity  of  order  35  to  45mW/cm^  is 
extrapolated,  baaed  upon  the  reciprocity  law  given  in  (7.42),  and 
that  the  ratio  of  incoherent  image-bearing  light  to  coherent  grating  light  is 
kept  constant. 


7.5  Th«  Readout  Process 

The  readout  process  consists  of  the  optical  modulation  of  the  coherent  readout 
beam  by  the  space-charge  field.  This  modulation  occurs  in  PICOC  through  the 
linear  electrooptic  effect  which  modifies  the  refractive  index  within  the  photore- 
fractive  crystal,  thus  establidiing  a  volume  phase  grating.  The  phase  hologram 
is  then  read  out  by  a  coherent  auxiliary  beam  to  achieve  the  conversion. 

The  diffiraction  characteristics  of  such  volume  phase  gratings  have  been 
studied  using  many  different  snalytiad  techniques,  including  coupled  wave 
analysis  by  ffsyefntib  [7.31]  with  extensive  numerical  studies  by  Kkin  and  Coot 
[7.32],  a  Bom  approximatkm  to  a  scattering  integral  by  (jsrdsn  [7.33],  and 
the  optical  beam  propagation  method  by  Yeviek  and  Thykn  [7.34]  and  John- 
son  and  Tanyasy  [7.35].  Methods  for  studying  the  polarization  properties  of 
light  diffiaction  in  electiooptic  crystals  sudi  as  bismuth  silicon  osdde  include 
anisotropic  versions  of  the  coupled  wave  formalism  [7.23, 36]  and  the  optical 
beam  propagation  method  [7.35]. 

This  section  examines  the  readout  of  the  phase  gratings  and  its  conse¬ 
quences  for  the  performance  of  PICOC  as  a  spatial  light  modulator.  Because  of 
the  high  spatial  frequencies  typically  used  in  PICOC,  the  grating  exhibits  pro¬ 
nounced  Bragg  diffiraction  characteristics  with  rapid  degradation  of  the  read¬ 
out  quality  whenever  the  Bragg  condition  is  detuned,  whether  by  increasing 
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the  spatial  frequency  of  the  incoherent  image,  by  slight  angtdar  misalignment, 
or  by  sub-optimum  alignment  of  the  incoherent  image  beam.  The  Bragg  sensi¬ 
tivity  is  discussed  in  the  section  on  the  isotropic  phase  grating  properties.  In 
addition,  because  of  the  optical  activity  exhibited  by  bismuth  silicon  oxide,  the 
polarization  states  of  both  the  transmitted  probe  light  and  the  diffracted  signal 
light  will  in  general  be  elliptical,  with  implications  for  the  optimtun  readout 
conditions,  as  discussed  in  the  section  on  polarization  properties. 


7.5.1  Isotropic  Phase  Grating  Model 

The  space-charge  field  induces  a  small  perturbation  in  the  index  of  refraction 
through  the  linear  electrooptic  effect,  such  that 

Zln(i)  =  in3r4i£i  cos  (Kax)  (7.43) 

in  which  no  is  the  nominal  index  of  refraction  and  r4i  is  the  electrooptic  co¬ 
efficient  appropriate  for  bismuth  silicon  oxide.  The  index  perturbation  in  turn 
modulates  the  optical  phase  fronts  of  an  incident  light  beam.  Thus  the  sinu¬ 
soidal  space-charge  field  induces  a  volume  phase  grating  in  the  crystal.  To  gain 
some  feeling  for  the  readout  performance  issues  involved  in  PICOC,  consider  a 
2  mm  thick  piece  of  bismuth  silicon  oxide  with  a  simple  unswdulated  sinusoidal 
grating  with  spatial  frequency  of  300  cydes/mm,  space-charge  field  of  5  kV/cm, 
and  probed  by  a  633  nm  laser  beam.  A  space-charge  field  £i  of  order  5  kV /cm 
induces  an  index  perturbation  An  at  order  2  X  10~^,  *t—»ning  the  material 
parameters  for  bismuth  silicon  oxide  listed  in  Table  7.1. 

Dimensionless  parameters  that  characterize  the  diffraction  characteristics 
of  such  a  phase  grating  include  the  grating  thickness  parameter  Q  and  the 
grating  strength  v,  defined  by 


V 


2irAnd 


(7.45) 


in  whidi  Ar  is  the  wavelength  of  the  readout  light  in  vscve  and  d  is  the  thick¬ 
ness  of  the  grating  [7.32].  For  a  300  cydes/mm  grating  in  a  2  mm  thick  crystal, 
read  out  by  a  633nm  laser  beam,  the  associated  thickness  parameter  Q  is  al¬ 
most  300,  which  is  considered  to  be  a  very  thick  grating  [7.30].  A  space-charge 
field  of  the  order  at  5kV/cm  induces  a  grating  strength  of  the  order  of  0.35 
radians,  which  gives  fairiy  weak  diffraction  effidency  (on  the  order  of  a  few 
per  cent).  The  combmatioo  of  large  thickness  parameter  Q  and  weak  grating 
strength  v  places  the  grating  deep  in  the  Bragg  regime  with  an  optical  diffiac- 
tion  effidency  rf  given  apprcodmately  by 

q  =*  [(o/2^)  sin  e]*  ,  with  (7.46) 


(7.47) 
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i  *  iA'cd  sin  -  tfe)  (7.48) 

(according  to  [Ref.  7.31,  Eqs.  17,  42.  and  43}  or  [Ref.  7.32,  Eqs.  6-8,  35.  and 
36]),  in  which  is  the  entrance  an^  of  the  incident  probe  light  measured 
with  respect  to  the  constant  phase  lines  at  the  coherent  grating,  and  9^{Kq) 
is  the  Bragg  angle  associated  with  wave  vector  Kq,  defined  by 


sin  6b  s 


ArAg 

4*rno 


(7.49) 


The  parameter  (  in  (7.47,48)  is  a  measure  of  the  Bragg  misalignment;  it  is 
equal  to  zero  when  the  readout  beam  is  perfectly  Bragg  aligned  with  respect 
to  the  volume  hologram. 

For  perfect  Bragg  alignment  in  which  ^in  =  6b,  and  T"iTig  a  grating 
strength  v  s  0.35  radians,  the  peak  diftaction  efildency  is  estimated  to  be  of 
order  3  %.  Doubling  the  thickness  of  the  grating  would  increase  the  difiaction 
efficiency  to  12  %  (ignoring  polarization  issues),  but  also  increases  the  Bragg 
detuning  sensitivity,  as  described  in  the  following  paragn4>hs. 

Bragg  detuning  impacts  PICOC  performance  in  two  ways:  the  angular 
alignment  sensitivity  of  the  hologram  to  the  coherent  readout  beam,  and  the 
spatial  frequency  response  of  the  hologram  readout  with  its  concomitant  angu¬ 
lar  alignment  dependence  on  the  incoherent  image-bearing  beam.  Consider  ^t 
the  alignment  sensitivity  to  the  coherent  readout  beam.  An  angular  tni«*ligT< 
ment  A0  from  the  optimum  Bragg  alignment  introduces  a  Bragg  mismatch  ( 
of 


t  «  ^KadAe  ,  (7.50) 

as  seen  from  (7.48).  Assuming  a  grating  strength  v  »  0.35  radians,  one  finds 
from  (7.46,47)  that  the  anmilar  alignment  sensitivity  of  the  diffiactioo  effi¬ 
ciency  has  essentially  a  sine'  profile.  The  angular  misalignment  A9  needed  to 
reach  the  first  null  of  this  profile  occurs  when  i.e.,  for 

Hence  the  angular  alignment  needed  to  achieve  optimum  diffraction  efficiency 
is  extremely  sensitive,  on  the  order  of  0.1”  for  a  2  mm  thick  photorefractive 
crystal  with  a  300  cycles/mm  grating  frequency. 

One  possible  alignment  of  the  PICOC  system  is  to  orient  the  readout  beam 
to  be  Bragg  matched  precisely  to  the  coherent  grating,  thereby  the 

intensity  of  the  /to  diffiacted  order.  However,  when  an  incoherent  grating  is  also 
incident  upon  the  crystal,  the  nonlinear  recording  process  creates  a  new  grating 
with  wave  vector  {Kq  +  As)  that  in  general  does  not  satisfy  the  same  Bragg 
condition  as  the  coherent  grating  wave  vector  Kq.  The  resulting  /n  intensity 
component  is  then  attenuated  by  an  amount  dependent  upon  the  image  wave 
vector  Ks  =  2s'/s,  in  which  /s  is  the  spatial  frequency  of  the  image  profile. 
The  dependence  at  the  attentiation  on  tte  image  frequency  /$  is  a  function  of 
the  orientation  of  the  image-beating  beam  with  respect  to  the  coherent  grating. 
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The  importance  of  the  image-bearing  beam  orientation  on  the  spatial  fre¬ 
quency  response  of  the  converter  is  strikingly  illustrated  by  Fig.  7.24,  in  which 
are  shown  converted  images  of  two  orthogonal  orientations  of  a  5  line  pair /mm 
Ronchi  ruling  and  the  associated  coherent  Fourier  transforms.  As  can  be  seen 
from  the  figure,  a  significant  difference  in  resolution  exists  between  cases  in 
which  the  wave  vector  of  the  ruling  (incoherent  grating)  is  parallel  or  perpen¬ 
dicular  to  the  coherently  written  (holographic)  grating.  This  difference  derives 
principally  from  the  fact  that  a  different  wave  vector  m*t«-hing  condition  exists 
for  these  two  cases. 

Consider  the  alternative  wave  vector  matching  conditions  shown  in  Figs. 
7.25  and  7.26.  In  Fig.  7.25,  the  incoherent  grating  wave  vector  /Ts  **  parallel  to 
the  coherent  grating  wave  vector  Kq,  a  condition  achieved  by  symmetrically 
disposing  the  incident  coherent  beams  about  the  normal  to  the  crystal  while 
simultaneously  arranging  the  incoherent  imaging  system  such  that  its  optical 
axis  is  parallel  to  the  crystal  normal.  In  this  case,  significant  Bragg  detuning 
occurs  for  even  small  incoherent  grating  wave  vectors. 


Ronchi-Ruling  Fourier  Transform 

5  Ip/mm 


Pig.  T.74.  Photograph  of  ths  Pmitmt  transfenn  of  a  Ronchi  ruling  racordaJ  in  the  PICOC 
configuratioa,  thoaring  a  strong  anisotropy  in  tho  rasolution  potfermaDco  srfaan  ths  imago- 
bearing  light  beam  biaaeta  tho  taw  coharant  grating  arriting  boama 
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CIRCLE 


Fig.  T.38.  Wmvc  vtctor  mi*- 
iMtch  diagram  for  tho  caae 
in  which  tha  imaga-baating 
baam  biaacta  tha  taro  eohat^ 
ant  grating  arriting  baama 


Let  the  spetiel  bandwidth  of  the  readout  proceaa  be  defined  as  that  spatial 
frequency  /s  for  which  the  magnitude  of  the  /n  difiacted  intensity  component 
degrades  to  25%  of  its  peak  valur  This  occurs  when  {  as  1.9,  as  determined 
from  (7.46,47),  attiiining  a  small  grating  strength  v  on  the  order  of  0.35  ra¬ 
dians.  The  Bragg  mismatch  parameter  (  is  a  frinction  of  the  image  spatial 
frequency  /g,  and  can  be  approximated  by  the  first  term  or  two  in  a  Tbylor 
series  expansion  of  the  form 

«/s)-«o  +  /s^  +  5/|^ 

•f  higher  order  terms  (7.52) 

in  whidi  (g  represents  the  Bragg  mismatch  associated  srith  the  coherent  grat¬ 
ing.  Standard  oUgnmwnt  procedure  is  to  set  this  zeroth  order  mismatch  term  (o 
to  zero,  i.e.,  to  Bragg  match  the  incident  readout  beam  to  the  coherent  grating. 

fbr  the  case  of  the  image-bearing  beam  bisecting  the  two  coherent  grating 
beams,  as  shown  in  Fig.  7.25,  the  first  order  term  in  (7.52)  is  a  sufficiently 
accurate  estimate  of  the  Bragg  mismatch  parameter  and  this  term  can  be 
derived  from  (7.48)  to  give 


dfs 


(7.53) 


By  (7.49),  the  term  di^/4fs  equivalent  to 


^ _ 

4fs  2neoos#B 


(7.54) 


Hence  the  spatial  bandwidth  /§  is 

/  7.6  Hg  cos  8g  . 

*  \liKQd  ’ 

which  can  be  expressed  in  terms  of  the  fringe  spacing /l<j  of  the  coherent  grating 
wavelength  as 


(7.56) 
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Fig.  7 .36.  Optimum  wave 
vector  alignment  tliagram 
in  which  the  image-bear¬ 
ing  light  ia  tilted  with  re¬ 
spect  to  the  bisector  of  the 
tarn  coherent  grating  writ¬ 
ing  beams  by  the  Bragg 
angle  associated  with  the 
probe  beam 


For  the  parameters  used  in  our  experiments,  this  bandwidth  is  estimated  to 
be  on  the  order  of  8  cyeles/mm.  Note  that  doubling  the  crystal  thickness  d 
'o  increase  the  diffraction  efficiency  by  almost  a  factor  of  four  has  the  adverse 
effect  of  reducing  *he  spatial  bandwidth  /s  by  a  factor  of  two  for  this  particular 
alignment  configuration. 

Compare  now  the  resolution  performance  associated  with  the  alignment  of 
Fig.  7.2S  with  that  for  Fig.  7.26.  In  Fig.  7.26,  the  incoherent  grating  wave  vector 
ia  arranged  to  lie  tangentially  to  the  circle  defined  by  the  readout  beam  wave 
vector,  such  that  a  significantly  increased  angular  deviation  of  the  diffracted 
beam  is  allowed  before  serious  Bragg  detuning  effects  occur.  Such  a  wave  vec¬ 
tor  tangency  condition  is  automatically  satisfied  when  the  incoherent  image 
wave  vector  is  normal  to  the  coherent  grating  wave  vector  (as  it  is  in  the  y 
orientation  normal  to  the  plane  of  incidence).  Alternatively,  the  wave  vector 
tangency  condition  is  satined  when  the  central  ray  of  the  incoherent  image 
beam  is  parallel  to  the  diffiracted  probe  light,  for  which  the  horizontal  and 
vertical  resolutions  become  degenerate.  This  is  not  the  case  for  the  situation 
depicted  in  Fig.  7.25,  which  explains  the  observations  apparent  in  Fig.  7.24.  An 
equivalent  condititm  has  been  described  by  Katruhilin  and  Petrov  [7.6]  for  the 
nondegenerate  four-wave  mixing  optical  architecture. 

In  the  geometry  of  Fig.  7.26,  the  addition  of  an  image  fluency  /$  shifts 
the  pointing  direction  of  the  combined  wave  vector  (Kq  -h  Ks)  such  that  the 
incident  readout  light  remains  almost  perfectly  Bragg  matched  over  a  much 
broader  range  oi  image  frequencies.  Mathematically,  this  condition  ia  equivalent 
to  setting  the  first  order  term  o{  the  Taylor  series  expansion  in  (7.52)  for 
the  Bragg  mismatch  parameter  (  equal  to  zero.  To  demonstrate  this,  consider 
the  mote  general  case  in  whidx  the  image  profile  wave  vectw  Kg  is  oriented 
at  a  small  angle  0s  ^th  respect  to  the  coherent  grating  wave  vector  Kq-  The 
combinatkm  (Ko  -f-  fCg)  rotates  through  a  small  angle  Afi  compared  with  wave 
vector  Kq  alone,  with  A0  given  apimodmately  by 


The  Bragg  mismatch  parameter  {  induced  by  the  image  wave  vector  A’s  is 
to  first  order  given 


(7.58) 


and  these  two  terms  cancel  when  the  skew  angle  the  Bragg  angle  for 

the  readout  light  beam  associated  srith  the  coherent  grating  frequency.  For  a 
633  nm  readout  beam  wavelength  and  a  300  cycles/mm  grating  fluency,  the 
Bragg  angle  fig  is  of  order  5°,  implying  that  a  mere  5°  separates  the  optimum 
resolution  alignment  of  Fig.  7.26  from  the  alignment  of  Fig.  7.25.  In  addition, 
the  angular  alignment  tolerance  on  the  incr^erent  image-bearing  beam  is  much 
tighter  than  the  nominal  alignment  angle  of  5^,  typically  of  order  0.7°. 

With  the  alignment  of  Fig.  7.26,  the  degradation  in  diffraction  efficiency 
then  becomes  a  second  order  Action  of  the  image  spatial  frequency  /$.  The 
second  derivative  term  in  (7.52)  is 


no 

and  the  resulting  spatial  bandwidth  of  the  tangential  configuration  is 


(7.59) 


Thin  expression  was  first  published  tCanuhiUn  and  Petrov  [7.6].  For  our  ex¬ 
perimental  parameters,  a  tangential  ^ometry  increases  the  converter’s  band¬ 
width  from  8  to  48  cycles/mm.  It  is  interesting  to  note  that,  in  addition  to 
the  increased  bandsddth  of  the  tangential  geometry,  the  spatial  resolution  is 
independent  of  the  coherent  grating  freque4.cy,  and  that  doubling  the  thickness 
of  the  crystal  d  does  not  halve  the  converter’s  bandwidth,  as  it  would  for  the 
ulignmmt  configuration  shown  in  Fig.  7.25,  but  only  reduces  it  by  a  factor  of 
y/2. 

Further  experimental  testa  of  the  Bragg  detuning  hypotheses  are  presented 
in  Figs.  7.27-29,  in  which  the  image  source  vras  a  Michelson  interferometer 
(shown  in  Fig.  7.15)  to  alleviate  the  depth  of  focus  issues  discussed  previously. 
In  these  experiments,  the  intensity  of  the  di&acted  component  /jx  was  mea¬ 
sured  as  a  function  of  the  spatial  frequency  of  the  image  source  grating.  Figure 
7.27  shows  the  response  for  the  geometry  in  which  the  incoherent  image  beam 
bisects  the  ctdierent  grating  writing  beams  to  achieve  the  wave  vector  mis¬ 
match  condition  diagrammed  in  Fig.  7.2S.  The  predicted  frequency  response 
rolloff  with  bandwidth  of  8  cycles/mm,  shown  by  the  solid  line,  compares  rea¬ 
sonably  with  the  experiment^  data  points  whidi  indicate  a  bandwidth  of 
6  cycles/mm.  Figure  7.28  shows  the  rolk^  when  the  signal  beam  wave  vector 
is  timgjmt  to  the  equiphase  drcle  as  shown  in  Fig.  7.26,  giving  much  improved 
frequency  response  of  45  cycles/mm  which  is  in  excellent  agreement  with  the 
theoretically  predicted  48  cycles/mm.  Figure  7.29  shows  the  rolloff  when  the 
signal  light  wave  vector  is  nominally  aligned  to  be  perpendicular  to  the  ^plied 
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rig.r  .37.  Exparimwital  mea¬ 
surement  of  the  diffraction  ef- 
fleieney  ae  a  function  of  im¬ 
age  epatial  frequency  amociated 
with  the  alignment  ahown  in 
Fig.  7.25 


Fig.  T.2S.  CxperinieBtat  mea¬ 
surement  of  tte  diffraction  ef- 
ftaency  as  a  (imetioa  of  im¬ 
age  spatial  frequency  asenriated 
with  the  optimum  alignment 
shown  in  Fig.  7.35 


Fig.TJP.  Experimental  manp 
suconaent  of  tte  diffitaction  eC- 
fideney  as  a  function  of  im¬ 
age  spatial  frequency  fee  the 
alignment  in  which  the  image 
hewing  beam’s  ware  vector  is 
normal  to  tbs  applied  bias  elec¬ 
tric  flaM 


bias  field  and  to  the  oohereot  grating  waive  vector.  The  theoretical  bandwidth 
for  this  configuration  should  be  identical  to  that  shown  in  Fig.  7.28,  but  the 
measurements  indicate  a  bandwidth  oi  only  25  cycles/mm.  This  discrepancy 
may  be  attributable  to  the  very  high  alignment  sensitivity  of  the  incoherent  im¬ 
age  beam  relative  to  the  plane  defined  by  the  two  coherent  writing  beams,  e.g., 
a  deviation  of  appcoocimately  0.7”  would  cause  a  deterioratioo  of  the  bandwidth 
from  48  cycles/mm  to  25  cycles/mm. 
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To  summarize,  the  presence  of  the  coherent  grating  in  PICOC  » 

volume  hologram  that  is  typically  operated  quite  deep  into  the  Bragg  re^e, 
with  very  strong  misalignment  sensitivities.  As  a  consequence,  the  readout 
beam  must  be  aligned  typically  within  0.1“  of  optimum,  the  image-bearing 
beam  must  be  aligned  typically  within  0.7“  of  its  optimum,  and  the  optimum 
alignment  for  the  image-bearing  beam  is  not  to  bisect  the  two  coherent  grating 
beams  (Fig.  7.25),  but  rather  offset  from  this  by  a  small  angle  typically  of  order 
5“  (as  shown  in  Fig.  7.26). 

7.5.2  Polarisation  Issues 

Bismuth  silicon  oadde  has  quite  remarkable  optical  polarization  properties  that 
stro^y  influence  the  optimization  of  the  readout  process  (7.23).  These  proper¬ 
ties  include  significant  levels  of  natural  optical  activity  (as  high  as  46“ /mm  for 
the  488  nm  argon  ion  laser  wavelength  -  see  Fig.  7.30),  a  uniform  linear  bire¬ 
fringence  induced  by  the  applied  bias  field  through  the  electrooptic  effect,  and  a 
spatiaUy  varying  linear  birefringence  induced  by  the  image-defined  space-charge 
field.  In  almost  all  readout  configurations,  these  properties  will  cause  both  the 
readout  and  the  diffracted  signal  beams  to  exhibit  elliptical  polarizations. 

Consider  Fig.  7.31,  in  which  is  shown  the  evolution  of  the  polarization 
states  for  both  the  readout  and  the  di&acted  signal  beams  as  a  function  of 
depth  into  the  bismuth  silicon  oxide  crystal  for  a  633  nm  wavelength  readout 
beam  and  an  applied  bias  field  of  6kV/cm  in  the  absence  of  self-difiaction 
effects.  A  hologram  induced  by  a  single  coherent  grating  iinm<w<i6«.4  by  any 
image  profile  is  assumed.  In  this  figure,  we  have  chosen  to  plot  the  polarization 
states  that  result  from  an  input  polarization  set  at  45“  with  req>ect  to  the 
grating  wave  vector,  which  is  therefore  along  one  of  the  two  electrooptically  in- 
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Pig.  7  JO.  Optical  rauterjr  peww  of  bisnuth 
niken  aside  as  a  Ainctiea  of  the  wavelanith 
of  the  readout  light  (after  [7.37]) 


duced  principal  axes  of  the  crystal’s  index  ellipsoid.  As  such,  the  polarizations 
of  the  readout  and  signal  beams  are  nearly  parallel  for  very  shallow  deptlis 
but  quickly  evolve  toward  a  90°  major  axis  separation  with  increasing  depth 
because  of  the  influence  of  the  optical  activity.  This  separation  of  polarization 
states  enables  the  use  of  polarization  analyzer  techniques  to  suppress  the  scat¬ 
tered  readout  beam  light  in  favor  of  the  diffracted  signal  light.  The  intricacy 
of  the  polarization  state  evolution  can  be  appreciated  from  Fig.  7.31. 

One  potential  application  of  the  polarization  properties  in  bismuth  silicon 
oxide  has  been  demonstrated  by  Herriau  et  al.  [7.15]  for  obtaining  optimum 
holographic  readout.  They  attained  excellent  suppression  of  scattered  readout 
light  noise  for  a  nearly  on-axis  recording  configuration  by  placing  a  polarization 
analyzer  into  the  diffracted  signal  beam  path.  The  analyzer  is  then  adjusted 
to  eliminate  the  scattered  readout  beam,  and  since  the  diflwted  signal  beam 
generally  has  a  different  polarization  state  from  that  of  the  transmitted  readout 
beam,  a  significant  fraction  of  the  signal  beam  wiU  pass  through  the  analyzer. 
This  technique  greatly  improves  the  signal-to-noise  ratio  of  the  holographic 
reconstruction  process. 

The  presence  of  spatial  modulation  further  complicates  the  polarization 
properties  of  the  diffracted  li^t,  especially  when  the  incoherent  image  beam 
is  misaligned  from  the  optimum  wave  vector  matching  configuration  shown  in 
Fig.  7.26,  which  introduces  a  strong  polarization  state  dispersion.  That  is,  the 
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polarization  states  of  the  modulated  readout  beam's  di&action  orders  (/mn 
in  Fig.  7.10)  are  strongly  dependent  on  the  image  spatial  frequency  when  the 
optimum  alignment  of  Fig.  7.26  is  not  achieved.  Such  dispersion  further  de¬ 
grades  the  resolution  if  a  polarization  analyzer  is  used  to  separate  the  scattered 
readout  light  from  the  diffracted  signal  li^t.  In  contrast,  when  the  optimum 
alignment  of  Fig.  7.26  is  met,  the  polarization  dispersion  is  neglipble.  This  is 
one  more  reason  why  the  alignment  of  Fig.  7.26  is  crucial  to  achieving  the  best 
performance  from  the  PICOC  device. 

The  most  serious  issue  concerning  the  polarization  properties  of  volume 
holograms  in  bismuth  silicon  oxide  is  the  degradation  of  the  light  diffraction 
efficiency  that  is  imposed  by  the  optical  rotatory  power  [7.23].  However,  one 
readout  configuration  has  been  identified  in  extensive  polarintion  analyses 
that  does  not  degrade  the  diftaction  efficiency,  namely  the  crystal  geometry 
of  Fig.  7.8  with  no  applied  bias  field  and  with  a  circularly  polarized  readout 
light  beam.  The  diffraction  efficiency  for  a  linearly  polarized  readout  beam  is 
compared  with  that  of  a  circularly  polarized  readout  beam  in  Fig.  7.32,  showing 
the  marked  improvement  in  diffraction  efficiency  that  can  be  achieved  by  using 
circularly  polarized  light  at  the  correct  alignment. 

In  concluskm,  the  polarization  properties  of  light  diffraction  in  bismuth 
silicon  oxide  significantly  affect  the  performance  of  the  PICOC  modulator,  and 
can  be  exploited  to  improve  this  performance  in  terms  of  signal-to-noise  ratio 
and  to  attain  the  highest  possible  diftaction  efficiency. 


Fig.  Tja.  Cemparisen  of  the 
diffraction  efficiency  sefaievad  by 
linearly  polarised  and  circularly 
polaiisod  probe  light  when  the 
applied  bias  6eid  ia  set  to  sero 
during  the  readout  procass 


7.6  Conclusion 

Through  the  use  of  the  photorefractive  incoherent-to-coherent  optical  conver¬ 
sion  (PICOC)  process,  we  have  successfully  converted  incoherent  images  into 
their  negative  coherent  replicas.  The  PICOC  system  is  inexpensive,  easily  im- 
plementable,  and  compares  favorably  in  its  performance  with  other  photorefrac- 
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tive  sp&tial  light  modulators.  In  addition,  we  have  developed  a  mathematical 
framework  within  which  to  analyze  the  performance  of  the  converter.  Using  this 
formalism,  we  have  derived  the  conditions  necessary  to  achieve  high  linearity, 
good  contrast  ratio,  and  fast  temporal  response.  Experiments  have  been  con¬ 
ducted  which  verified  the  essential  features  of  these  predictions,  and  which  have 
underscored  important  materials  issues.  It  is  of  considerable  importance  to  ex¬ 
tend  the  model  to  account  more  accurately  for  these  materials  issues,  to  predict 
the  response  of  the  converter  at  higher  modulation  depths,  and  to  explore  the 
optical  phase  modulation  induced  in  the  output  image  and  its  impact  on  vari¬ 
ous  image  processing  architectures.  In  addition,  testing  of  the  photorefractive 
incoherent-to-coherent  optical  converter  in  a  representative  optical  image  pro¬ 
cessing  system  will  undoubtedly  highlight  significant  features  worthy  of  further 
study  and  advanced  development. 


7.  A  Appendix.  Steady  State  and  Temporal  Behavior 
of  the  Space>Charge  Field  Components  in  PICOC 
(Simultaneous  Erasure/Writing  Mode) 

In  this  Appendix,  the  expressions  for  the  lowest  order  components  of  the  space- 
charge  field  Emn  are  presented,  both  in  the  steady  state  and  with  full  temporal 
evolution.  Details  of  the  derivation  cf  these  expressions  can  be  found  in  [7.22]. 

7.A.1  Steady  State  Behavior 

If  the  light  incident  on  the  photorefractive  crystal  is  given  by 


I(x)  =  /o(l  +  me  cos  K(3X)  4-  /i(1  +  mg  cos  Ksx)  (7.A1) 

then  to  first  order  the  steady  state  response  of  the  £oi>  £l0>  utd  En  compo- 
nents  are 


ElO  -  -2"*Q  d(Kg) 

(7.A2) 

-  -  jmg  ™ 

(7.A3) 

-  imfmf[I?(Ka)V(Ks)D(JCa  + 

X  {Fi[£5#-*-i£D('Ka)l  +  -^2(^  +  >^D(^s)l} 

(7.A4) 

in  which 

_.<r ..  _  •Sq/o 

^  ^5q/o  +  Ss/i 

(7.A5) 

_•»  _  Ssli 

^  ^5a/o  +  5s/i 

(7.A6) 
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D(K)-l 

(7.A7) 

(7.A8) 

(7.A9) 

(7.A10) 

(7.A11) 

for  which  K  caa  assume  the  values  Kq,  K^,  and  {Kq  +  K^). 


T.A.2  Temporal  Response 

The  temporal  behavior  of  the  space-diarge  field  components  for  the  simulta¬ 
neous  erasuze/writing  mode  (SEWM)  is  discussed  in  this  section.  Consider  the 
following  sequencing  of  light  intensity  profiles: 

..  f  /o(l  +  mfl  cos  Kqx)  for  t  <  0 

1  /o(l  +  m<j  cos  Kqx)  +  /i(l  ms  cos  Ksx)  for  t  >  0 

(7.A12) 

The  temporal  evolution  ci  the  various  components  of  the  space-charge  field  is 
given  by 

£10  =  -  {(mo  -  m^)exp(  -  t/T(Ko)]  +  m^} 

X  [Eo  +  iEoiKG)]D-^{KG)  (7.A13) 


=  -ms{l  -  exp  ( -  t/T(/fs)l)[£b  +  iEiiiKs)]D-\Ks)  {7.A14) 


En  *  Mo  +  Mie-‘/^‘  Mje*'/^*  -t-  Mje”*/^*  (7.A15) 

in  which 

Mo  -  \rrt:§wf[D{KG)D{Ks)D{KG  +  KsT^ 

X  {^l['Eo+i£D(^G)I  +  'F*3(j^  +  >'®D(^s))}  (7.A16) 


M, 


j(m<3  -  m^)mf[D{KG)D{Ks)D{KG  +  ^s)]’" 

X  iEG(KG)]  +  F2[E9  +  i^DC^fs)] (1  -  ^)  } 


(7.A17) 
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.^2  =  -  +  A:s)]“^5r^ 

X  f^Fi[Eo-^-iEoiKG)](l-’^^+F2[Eo  +  iEo(Ks)]^  (7.A18) 


M4  =  -  i(mG  -  m*^)mf[D{KG)D{Ks)D{KG  +  Ks)]' 

.1  n 

T4-T3 

X  |F,(£o  +  iED(^G)](l-^) 

+  F2{£o  +  ii:D(/fs)l(l-“)} 

(7.A19) 

M’s  =  — (Mfl  +  Ml  +  Mb  +  Mb) 

(7.A20) 

The  dielectric  relaxation  time  constant  To  is  defined  by 

'T  "0 

To  - -  : 

/leno 

(7.A21) 

the  time  constants  Ti,  T2,  and  T3  are  defined  by 

ToC(Jf) 

D{K) 

(7.A22) 

for  K  s  Kq,  K%,  and  (Kq  +  Ks)  respectively;  and  the  time  constant  T4  is 
defined  by 

(7.A23) 

In  (7.A22),  the  factor  C{K)  is  defined  by 

(7.A24) 

Note  that  the  GEM  temporal  response  caa  be  derived  from  the  SEWM 
expression  by  setting  the  average  cohoent  grating  intensity  /q  s  0  for  t  >  0. 
Th\u  the  Mb  and  Mb  tenns  disappear  for  GEM,  leaving  the  Mi,  Mt,  and 
Ms  :■  -(Ml  +  Mb)  terms.  The  GIM  response  can  in  turn  be  derived  from  the 
GEM  response. 
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Introduction 

A  number  of  inherent  inefficiencies  exist  in  the  photorefractive  recording  of 
grating  structures  due  to  the  nature  of  the  photoexcitation  and  charge  trans¬ 
port  processes.  These  inefficiencies  can  be  quantified  by  postulating  highly 
idealized  photoexcitation  and  charge  transport  models  that  yield  optimum 
(quantum  limited)  space  charge  field  distributions,  assuming  a  photoexci¬ 
tation  constraint  of  no  more  than  one  mobile  charge  per  incident  photon. 
By  comparing  such  highly  idealized  photorefractive  recording  nnodels  with 
more  realistic  models,  the  fundamental  origins  of  several  such  inherent  inef¬ 
ficiency  factors  can  be  identified  and  their  magnitudes  estimated.  In  this 
manner,  the  grating  recording  efficiencies  of  photoreftactive  materials  can 
be  directly  compared  with  the  fundamental  physical  limitations  imposed  by 
quantum  constraints. 

In  this  chapter,  we  will  establish  the  absolute  quantum  efficiency  of  the 
photorefractive  grating  recording  process  by  deriving  the  optimum  idealized 
photorefractive  recording  model  subject  to  such  quannim  constraints.  A  more 
realistic  photoexcitation  and  charge  transport  model  applicable  to  numerous 
currently  investigated  real  time  photorefractive  materials  will  then  be  ex¬ 
amined  in  depth,  with  emphasis  on  a  comparison  with  the  characteristics  of 
the  optimum  quantum  limited  model.  This  realistic  charge  transport  model, 
based  on  the  extensive  previous  work  of  numerous  authors,  is  presented  in 
such  a  manner  as  to  illustrate  its  statistical  nature  and  to  provide  a  physically 
intuitive  interpreution  of  its  principal  attributes. 

Of  all  of  the  parameters  that  seek  to  quantify  the  absolute  or  relative  per¬ 
formance  of  photorefractive  materials,  one  of  the  most  important  is  the  pho¬ 
torefractive  sensitivity  (von  der  Linde  and  Glass,  197S;  Micheron,  1978; 
Glass,  1978;  Gunter.  1982;  Yeh,  1987a  and  1987b;  Glass  et  al.,  1987;  Val¬ 
ley  and  Klein,  1983).  This  key  parameter  is  typically  defined  in  theoretical 
analyses  either  as  the  refractive  index  modulation  obtained  in  writing  a  uni¬ 
form  grating  of  fixed  spatial  frequency  per  unit  absorbed  recording  energy 
density  (energy  per  unit  volume)  (von  der  Linde  and  Glass.  197S;  Micheron, 
1978;  Glass,  1978;  Gunter.  1982;  Glass  et  al..  1987;  Valley  and  Klein.  1983), 
or  as  the  inverse  of  the  recording  energy  density  required  to  achieve  a  spec¬ 
ified  value  of  the  diffraction  efficieiKy  for  a  uniform  grating  of  fixed  spatial 
frequency  in  a  material  of  given  thickness  (Valley  and  Klein,  1983;  Huig- 
nard  turd  Micheron.  1976).  Alternatively,  for  purposes  of  experimental  mea¬ 
surement.  the  photorefractive  sensitivity  may  be  specified  as  the  inverse  of 
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the  recording  energy  density  requited  to  reach  a  given  fraction  of  the  sat¬ 
uration  diffraction  efficiency  of  a  particular  material  (Gunter,  1982;  Amodei 
and  Staebler,  1972).  The  photorefractive  sensitivity,  and  the  fundamental 
physical  limitations  that  apply  to  it.  are  of  current  significant  interest  because 
they  establish  the  maximum  reconfiguration  rate  of  volume  holographic  op¬ 
tical  elements  (VHOEs)  (von  der  Linde  and  Glass,  1975;  Tanguay,  1985) 
at  constant  average  optical  input  power.  The  ntaximum  reconfiguration  rate 
of  VHOEs  is  in  turn  important  for  applications  ranging  from  massively  par¬ 
allel  interconnections  in  optical  processing  and  computing  systems  (Tan¬ 
guay,  1985)  to  the  development  of  the  photorefractive  incoherent-to-coher- 
ent  optical  convener  (PICOC)  (Kamshilin  and  Petrov,  1980;  Shi  et  al. ,  1983; 
Marrakchi  et  al.,  1985). 

A  number  of  factors  contribute  to  the  various  photorefiactive  sensitivities 
characteristic  of  photoconductive.  electrooptic  materials.  One  such  factor  is 
the  photogeneration  quantum  efficiency,  which  represents  the  number  of 
photogenerated  mobile  charge  carriers  per  photon  absorbed  from  the  re¬ 
cording  beam(s).  A  second  factor  is  the  charge  transpon  efficiency,  which 
is  a  measure  of  the  degree  to  which  the  average  photogenerated  mobile  charge 
carrier  contributes  to  the  forming  space  cha^  grating  after  separation  from 
its  original  site  by  means  of  drift  and/or  diffusion  and  subsequent  trapping. 
The  magnitude  of  the  space  charge  field  generated  by  a  given  space  charge 
grating  is  inversely  proportional  to  the  dwiectric  permittivity  c  of  the  pho¬ 
torefractive  material,  which  thus  contributes  a  third  factor  to  the  grating 
recording  sensitivity.  A  fourth  factor  describes  the  perturbation  of  the  local 
index  ellipsoid  (dielectric  tensor  at  optical  frequencies)  that  results  from  a 
given  space  charge  field  through  the  electrooptic  (Pockels  or  Kerr)  effect. 
And  finally,  a  fifth  factor  pertains  to  the  physical  optics  inherent  in  the  read¬ 
out  process,  whereby  the  diffraction  efficieiKy  and  polarization  properties 
of  the  readout  beam  are  derived  directly  ftom  the  index  ellipsoid  modulation. 

In  addition,  several  other  physical  quantities  factor  into  an  evaluation  of 
the  photorefractive  sensitivity,  itKiuding  the  wavelength  of  the  recording 
illumination  (to  convert  the  number  of  absorbed  photons  into  an  equivalent 
energy),  the  absorption  coefficients  of  the  material  at  the  wavelengths  of 
both  the  recording  and  readout  beams  (to  correct  for  the  fractional  absor¬ 
bance  of  the  recording  beams  and  the  fractional  transmittance  of  the  readout 
beam),  and  the  magnitude  of  the  applied  voltage  (which  significantly  alters 
the  sensitivity  characteristics  for  certain  materials  by  changing  the  nature  of 
the  dominant  charge  transport  mechanism  from  the  diffusion  regime  to  the 
drift  regime). 
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In  previous  treatments  of  the  photorefractive  sensitivity  and  its  associated 
limits,  all  of  the  abovementioned  factors  have  been  addressed  to  some  de¬ 
gree.  though  not  necessarily  within  a  single  unified  treatment,  or  even  within 
a  consistent  set  of  constraints,  assumptions,  and  approximations.  Such  a 
unification  is  also  beyond  the  scope  of  the  present  work.  In  this  chapter,  we 
present  the  results  of  a  study  in  which  we  have  approached  the  photore¬ 
fractive  sensitivity  issue  from  a  somewhat  different  perspective:  that  of  the 
absolute  quantum  efficiency  of  the  photorefractive  grating  recording  process. 
As  such,  we  attempt  to  answer  an  oft-stated  but  as  yet  unanswered  question 
as  to  the  origin  of  the  apparent  insensitivity  of  photorefractive  grating  re¬ 
cording.  panicularly  in  comparison  with  the  relatWely  high  sensitivities 
characteristic  of  electrooptic  spatial  light  modulators  that  utilize  a  similar 
combination  of  photoconductive  charge  separation  and  electrooptic  modu¬ 
lation  in  the  same  single  crystal  materials  (Tanguay.  1983). 

In  order  to  provide  a  quantitative  metric  that  does  in  fact  have  a  funda¬ 
mental  physical  limiution.  we  define  herein  the  grating  recording  efficiency 
of  a  photorefractive  .^cording  model  or  configuration  as  the  magnitude  of 
the  space  charge  field  produced  by  a  fixed  number  of  photogenerated  mobile 
charge  carrien  at  a  given  spatial  frequency,  normalized  by  the  maximum 
quantum  limited  space  charge  field  that  can  be  produced  by  the  same  number 
of  photogenerated  carrien.  This  metric  thus  effectively  combines  the  notions 
of  a  photogeneration  efficiency  and  a  charge  transport  efficiency,  and  it  pro¬ 
vides  an  estimate  of  the  fundamental  quantum  efficiency  of  the  photore¬ 
fractive  grating  recording  process  as  determined  by  the  particular  photoex- 
ciution  and  charge  transport  mechanism  invoked. 

In  Older  to  fully  utilize  the  corKept  of  the  grating  recording  efficiency, 
we  firet  calculaK  the  maximum  quantum  limited  space  charge  field  that  can 
be  produced  by  a  fixed  number  of  photogeneraied  mobile  charge  carriers, 
assuming  optimum  photoexcitaiion  and  redistribution  (transport)  functions. 
We  then  cakulaie  the  grating  recording  efficiencies  dial  describe  several 
important  limiting  cases  with  selected  idealized  photoexcitaiion  and  redis¬ 
tribution  hinctioas.  This  in  turn  allows  for  the  definidon  of  a  baseliiK  case 
against  which  more  realistic  charge  transport  models  can  be  compared.  Ex¬ 
amination  of  a  particuiar  charge  transport  model  as  a  test  case  then  indicates 
several  other  genetically  applicable  factors  that  act  to  further  decrease  the 
grating  recording  efficiency  in  various  recording  conflguratioia.  These  re¬ 
sults  allow  the  above  analysis  to  be  conveniently  utilized  for  a  wide  range 
of  applications  of  curtem  technological  interest. 

The  organization  of  the  remainder  of  this  chapter  is  as  follows.  A  brief 
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summary  of  the  origin,  materiai  dependence,  and  implications  of  the  several 
parameters  that  affect  the  photorefractive  grating  recording  sensitivity  is  pro¬ 
vided  in  the  following  section  (Section  2).  Alternative  models  of  the  pho¬ 
torefractive  recording  process  are  defined  in  Section  3,  and  the  appropriate 
grating  recording  efficiencies  are  presented  therein  for  each  case.  Assump¬ 
tions  and  limitations  common  to  all  of  the  models  are  discussed  in  Section 
3.1.  Idealized  photogeneration  and  charge  transport  models  are  defin<Ml  and 
analyzed  in  Section  3.2.  and  more  realistic  models  based  upon  the  :uialyses 
of  Young  et  al.  (1974)  and  Moharam  et  ai.  (1979)  in  the  initial  stages  of 
recording  before  significant  space  charge  field  amplitudes  evolve,  and  of 
Kukhtarev  ( 1976)  for  temporal  evolution  with  small  modulation  depths  of 
the  illumination  profile,  are  discussed  and  analyzed  in  Section  3.3.  Repie- 
senutive  grating  recording  efficiency  calculations  are  presented  in  Section 
4  for  several  common  materials  and  applications  in  order  to  illustrate  the 
effect  of  the  quantum  inefficiency  factors  on  the  overall  quantum  efficiency 
of  photorefractive  recording.  Firully.  conclusions  drawn  from  the  above 
analyses  are  discussed  in  Section  S. 


2 - ^ _ 

Factors  Contributing  to  the  Photorefractive 
Sensitivity 

In  this  section,  each  of  the  five  factors  outlined  in  the  introduction  that  col¬ 
lectively  determine  the  photorefractive  sensitivity  is  briefly  discussed,  in  or¬ 
der  to  provide  a  suitable  context  for  the  derivation  of  the  quantum  limited 
grating  recording  efficiency  as  presented  in  the  following  section.  It  should 
perhaps  be  emphasized  at  the  outset  that  although  each  of  the  primary  factors 
considered  herein  affects  at  least  one  of  the  aforementioned  alternative  pho¬ 
torefractive  sensitivity  parameters,  not  all  of  the  factors  enter  into  each  de¬ 
fined  parameter. 

In  the  context  of  photorefractive  grating  recording,  the  photog^neration 
quantum  efficiency  is  related  to  the  fraction  of  the  incident  photon  flux  that 
generates  mobile  charge  carriers  free  to  participate  in  subsequent  charge 
transport  and  trapping  processes.  For  a  given  recording  wavelength,  several 
distinct  photoexcitation  processes  can  contribute  to  the  total  absorption  coef¬ 
ficient.  In  most  commonly  considered  models  of  the  photorefractive  effect, 
the  dominant  process  is  the  photogeneration  of  free  carriers  from  deep  donor 
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or  acceptor  states,  such  that  only  om  sign  mobile  carrier  (either  an  electron 
or  a  hole)  is  liberated  for  each  photoevent.  In  wavelength  regions  of  sig¬ 
nificant  photoconductivity,  a  second  important  process  is  the  creation  of 
electron-hole  pairs  (as  well  as  excitons  in  certain  materials  and  material 
structures)  by  means  of  band-to-band  transitions.  Examples  of  photoinduced 
transitions  that  are  not  likely  to  contribute  to  the  photorefractive  effect,  and 
hence  tend  to  reduce  the  photogeneration  quantum  efficiency,  are  intersub¬ 
band  absorptions,  inttaionic  level  promotions,  quantum  well  interievel  ex¬ 
citations.  and  photochromic  charge  transfer  exchanges.  Since  each  contrib^ 
uting  process  will  in  general  be  characterized  by  its  own  charge  transport 
efficiency  (discussed  below),  it  is  most  appropriate  to  assign  separate  pho¬ 
togeneration  quantum  efficiencies  not  only  to  each  charge  carrier  type,  but 
also  to  each  distinct  photoproduction  origin  (or  photoexcitation  channel). 

The  inherent  absorptive  inefficiency  implied  by  a  finite  thickness  photo- 
refractive  medium  also  affects  the  overall  phbtogeneration  quantum  effi¬ 
ciency.  In  the  case  of  a  thin  crystal  (such  that  ad  «  1.  in  which  a  is  the 
absorption  coefficient  at  the  recording  wavelength  and  d  is  the  crystal  thick¬ 
ness),  only  a  small  fraction  (»  ad)  of  the  recording  beam  intensity  is  ab¬ 
sorbed  and  hence  has  an  opportunity  to  participate  in  the  photorefractive 
process.  In  a  thicker  crystal,  for  which  the  thickness  may  be  optimized  for 
maximum  saturation  diffraction  efficiency,  the  absorbed  fraction  is  (1  - 
e'^]  if  only  the  entrance  surface  of  the  photorefractive  medium  is  allowed 
to  achieve  saturation.  In  this  case  the  recorded  grating  will  exhibit  an  ex¬ 
ponential  nonuniformity  throughout  the  crystal  thickness  that  will  decrease 
the  maximum  achievable  diffraction  efficiency.  If  the  entire  crystal  is  ex¬ 
posed  to  saturation,  the  photogeneration  quantum  efficiency  will  be  further 
reduced  by  a  factor  of  order  e~** .  Finally,  the  effects  of  reflection  at  both 
front  and  rear  crystal  surfaces  reduce  the  faction  of  incident  photons  that 
contribute  to  the  fonnation  of  a  given  grating  component  and.  hence,  also 
reduce  the  effective  quantum  efficiency.  For  a  crystal  thickness  optimized 
for  maximum  saturation  diffraction  efficiency  with  equal  write  and  read 
wavelengths,  and  for  indices  of  refraction  typical  ot  common  photorefractive 
materials,  the  combined  efrects  of  absorption  of  the  recording  beams,  ab¬ 
sorption  of  the  readout  beam,  and  reflection  losses  (assuming  uncoated  sur- 
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An  additional  effect  that  acts  to  reduce  the  phoiogenetation  quantum  ef¬ 
ficiency  is  the  constiamt  imposed  indirectly  by  the  nmure  of  the  photoex¬ 
citation  distribution.  As  we  shall  demonstrate  in  the  next  section,  the  sin¬ 
usoidal  intensity  interference  pattern  generated  by  two  coherent  recording 
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beams  is  not  the  optimum  (quantum  iimited)  photoexciution  distribution 
function. 

Perhaps  the  most  critical  factor  that  determines  the  photorefractive  sen¬ 
sitivity  is  the  charge  transport  efficiency,  in  that  this  quantity,  above  all 
others,  exhibits  the  greatest  degree  of  variation  among  commonly  investi¬ 
gated  photorefractive  materials.  The  charge  transport  efficieiKy  quantifies 
the  degree  to  which  the  average  photoproduced  charge  carrier  contributes  to 
the  forming  space  charge  grating  following  photoexcitation,  charge  trans¬ 
port.  and  subsequent  recombination  or  trapping.  Charge  transport  in  the  re¬ 
fractory  oxides,  as  well  as  in  the  compound  semiconductor  family,  is  a  sta¬ 
tistical  process  in  which  the  net  result  of  a  given  photoinduced  event  may 
be  to  increase,  decrease,  or  leave  unchanged  the  magnitude  of  the  space 
charge  modulation  at  the  fundamental  grating  frequency. 

The  statistical  nature  of  the  charge  transport  process  can  be  subsumed  in 
the  standard  band  transport  treatment  (Young  et  al..  1974;  Moharam  et  al., 
1979;  Kukhtarev  et  al.,  1976.  1979).  or  ntade  explicit  as  in  the  hopping 
conduction  model  (Feinberg  et  al..  1980);  both  approaches  lead  to  essen¬ 
tially  equivalent  results  (Feinberg  et  al.,  1980;  Jaura  et  al.,  1986).  The  charge 
transpon  efficiency  is  strongly  affected  in  photorefractive  materials  by  the 
dominant  conduction  mechanism  (diffusion  and/or  drift  in  an  extertutlly  ap¬ 
plied  field),  and  by  the  ratio  of  the  average  displacement  of  a  photoexcited 
carrier  (before  recombination  or  trapping)  to  the  grating  spacing.  In  both  the 
drift  and  diffusion  regimes,  the  average  displacement  depends  prinuuily  on 
the  mobility-lifetime  product  of  the  photoexcited  species.  As  such,  separate 
charge  transport  efficiencies  should  be  assigned  to  each  carrier  type.  A  sig¬ 
nificant  net  charge  transport  efficiency  will  be  realized  only  if  there  is  a  net 
differential  in  the  carrier  displacement  and  trapping  process. 

It  should  be  noted  that  there  are  several  situations  that  can  yield  vanishing 
charge  oansport  efficietKies,  even  with  large  photogeneration  quantum  ef¬ 
ficiencies.  For  example,  in  a  single  donor/single  trap  model,  if  the  initial 
Fermi  level  is  mote  than  10  k^T  or  so  above  the  un-ionized  donor  level,  no 
substantial  charge  rearrangement  is  possible  due  to  the  unavailability  of  ion¬ 
ized  donors  (traps)  outside  the  regions  of  significant  photoexcitation.  Like¬ 
wise.  if  the  initial  Fermi  level  is  mote  than  10  k^T  or  so  below  the  un¬ 
ionized  donor  level,  only  band-to-band  photoexcitations  can  contribute  with 
nonvanishing  photogeneration  quantum  efficiencies,  which  again  is  likely  to 
yield  negligible  charge  transport  efficieiKy  unless  the  mobilities  and/or  life¬ 
times  are  significantly  different,  or  unless  operation  in  the  drift  regime  is 
engendered  by  employing  an  externally  applied  electric  field. 
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The  space  charge  grating  that  results  from  the  combination  of  photoex¬ 
citation  and  charge  transport  processes  in  turn  gives  rise  to  a  modulation  of 
the  local  electric  field  at  the  same  spatial  frequency  through  the  first  Max¬ 
well  equation: 

r(«E(jc))  =  — .  (3.1) 

in  which  E  is  the  total  electric  field  at  each  point  in  space  x.  c  is  the  dielectric 
permittivity  tensor,  p  is  the  local  space  charge  amplitude,  and  Co  is  the  di¬ 
electric  permittivity  of  free  space.  Note  that  the  magnitude  of  the  space  charge 
field  derived  from  a  given  space  charge  grating  amplitude  is  inversely  pro¬ 
portional  to  the  grating  wave  vector,  as  implied  by  the  differential  relation¬ 
ship  expressed  in  Eq.  (3.1).  The  tensor  character  of  c  is  important  to  note, 
as  many  photorefractive  materials  (particularly  the  ferroelectric  oxides)  ex¬ 
hibit  marked  dielectric  anisotropy.  Hence,  space  charge  gratings  oriented  in 
different  directions  within  the  same  crystal  can  give  rise  to  quite  large  vari¬ 
ations  in  the  resultant  space  charge  field.  Note  further  that  the  magnitude 
of  the  space  charge  field  scales  inversely  with  a  diagonal  component  of  the 
dielectric  permittivity  for  a  given  space  charge  grating  oriented  along  a  prin¬ 
cipal  dielectric  axis  of  the  crystal.  Thus,  materials  with  large  dielectric  con¬ 
stants  (such  as  BaTiO]  and  SBN)  require  correspondingly  large  space  charge 
amplitudes  in  order  to  produce  an  internal  electric  field  modulation  of  given 
amplitude. 

In  the  types  of  photorefractive  materials  considered  herein,  the  index  of 
refraction  is  a  fuiKtion  of  the  local  electric  field.  This  dependence  can  arise 
from  a  number  of  electrorefractive  effects,  including  among  others  the  linear 
electrooptic  (Pockels)  effect,  the  quadratic  electrooptic  (Kerr)  effect,  the  Franz- 
Keldysh  effect,  and  the  quantum  confined  Sbult  effect  (Chemla  et  al. .  1985). 
In  some  materials,  notably  multiple  quantum  well  structures  in  compound 
semiconducton,  more  than  one  such  electrorefractive  effect  can  contribute 
simultaneously  to  the  establishment  of  the  resultam  index  perturbation.  For 
our  purposes  herein,  we  consider  cMily  the  linear  electrooptic  effect,  in  which 
the  change  in  the  dielectric  impermeability  tensor  B  (the  inverse  of  the  di¬ 
electric  tensor  e)  is  linear  in  the  electric  field: 

=  (3.2) 

in  which  the  Einstein  summation  rule  is  implied,  and  in  which  rjjk  is  the 
third  rank  tensor  representing  the  etectrooptic  coefficient  (Kaminow,  1974). 
As  shown  in  Eq.  (3.2),  the  tensor  nature  of  the  electrooptic  effect  implies 
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a  dependence  of  the  effective  index  of  refraction  on  the  orientation  of  the 
grating  within  the  crystal,  as  well  as  on  the  direction  of  propagation  of  the 
readout  beam  and  its  polarisation.  Since  in  general  one  can  derive  an  ef* 
fective  electrooptic  coefficient  for  a  given  experimental  configuration.  Eq. 
(3.2)  nuiy  be  rewritten  in  the  form 

An<ff(jr)  =  (3.3) 


in  which  the  x  coordinate  is  taken  parallel  to  the  grating  wave  vector.  An.„(.T) 
is  the  effective  index  modulation  resulting  from  the  combination  of  the  space 
charge  field  and  the  readout  configuration,  and  /Iq  is  the  corresponding  un¬ 
perturbed  index  at  the  wavelength  of  the  readout  beam. 

In  discussions  of  the  photorefractive  sensitivity,  it  is  of  considerable  value 
to  combine  the  effects  of  the  previous  two  factors,  since  it  has  been  shown 
that  for  a  wide  range  of  common  photorefractive  materials,  the  ratio 
e  exhibits  considerably  reduced  variation  compared  with  that  of  each  pa¬ 
rameter  separately  (Glass  et  al.,  1984;  Glass.  1984).  This  is  indicative  of 
the  general  observation  that  materials  with  large  static  polarizabilities  typi¬ 
cally  also  exhibit  concomitantly  large  perturbations  of  the  dielectric  tensor 
at  optical  frequencies  in  response  to  low  frequency  applied  (or  internal)  elec¬ 
tric  fields. 

Once  the  magnitude  of  the  index  perturbation  is  established,  the  resultant 
diffraction  efficiency  can  be  directly  determined  from  the  thickness  (and 
uniformity)  of  the  grating,  the  grating  wave  vector,  the  readout  wavelength, 
and  the  corresponding  absorption  coefficient.  Provided  that  the  grating  struc¬ 
ture  is  sufficiently  thick  to  assure  diffraction  in  the  Bragg  regime  and  that 
the  phase  and  amplitude  distortions  associated  with  self-diffiaction  effects 
can  be  neglected  (Kukhtarev  et  al.,  1979:  Marrakchi  et  al..  1987),  the  dif¬ 
fraction  efficiency  is  given  by  (Kogelnik.  1969) 


ri  *  exp 


(3.4) 


in  which  Ob  is  the  Bragg  angle.  The  first  term  in  this  expression  denotes  the 
inherent  inefficiency  associated  with  finite  absorption  at  the  readout  wave¬ 
length.  while  the  second  term  derives  from  the  grating-modulation-induced 
diffraction  process.  Since  for  suitably  small  values  of  the  argument  irAnd/ 
X  cos  6b  the  diffraction  efficiency  scales  as  the  square  of  both  the  index 
modulation  and  the  grating  thickness,  this  photorefractive  sensitivity  factor 
is  inherently  nonlinear  and  must  be  utilized  with  considerable  caution. 
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Until  this  point  in  the  discussion,  we  have  assumed  implicitly  that  the 
grating  recording  exposures  and  spatial  frequencies  employed  have  been  large 
enough  and  small  enough,  respectively,  to  keep  the  photorefractive  record¬ 
ing  process  outside  the  additional  limitations  imposed  by  quantum  statistical 
fluctuations.  For  the  recording  of  photorefractive  gratings  at  very  high  spa¬ 
tial  frequencies  and  at  very  low  grating  recording  exposures,  several  addi¬ 
tional  factors  will  come  into  play,  including  statistical  fluctuations  in  the 
photogeneration  process,  corresponding  fluctuations  in  the  charge  transport 
and  trapping  processes  that  yield  a  locally  inhomogeneous  charge  distribu¬ 
tion  with  spatial  frequency  components  near  that  of  the  grating  wave  vector, 
and  concomitant  variations  in  the  direction  and  magnitude  of  the  local  space 
charge  field.  These  additional  factors  may  act  to  further  reduce  the  overall 
photorefractive  sensitivity. 


3 _ 

The  Grating  Recording  Efficiency 

As  defined  in  the  introduction  (Section  I).  the  grating  recording  efficiency 
of  a  given  photorefractive  recording  model  or  configuration  is  the  ratio  be¬ 
tween  the  magnitude  of  the  space  charge  field  at  a  given  spatial  frequency 
produced  by  a  fixed  number  of  photogenerated  mobile  charge  carriers,  and 
the  maximum  quantum  limited  space  charge  field  that  can  be  produced  by 
the  same  number  of  photogenerated  carrien.  The  grating  recording  effi¬ 
ciency  is  introd'  .?ed  as  a  useful  metric  that  effectively  compares  the  pho¬ 
togeneration  and  charge  transport  efficiencies  of  any  given  model  with  the 
optimum  quantum  limited  case,  and  as  such  provides  an  estimate  of  the 
overall  fundamental  quantum  efficiency  of  the  photorefhKtive  grating  re¬ 
cording  process.  Note  that  since  the  grating  recording  efficiency  is  normal¬ 
ized,  any  effect  of  the  dielectric  permittivity  tensor  in  establishing  the  mag¬ 
nitudes  of  the  space  charge  fields  caiKels  out.  Hence  the  grating  recording 
efficiency  may  be  equivalently  defined  directly  in  terms  of  the  space  charge 
grating  amplitudes  or  in  terms  of  the  space  charge  fields  for  a  dielectric 
constant  of  unity. 

In  this  sectuMi,  idealized  photogeneration  and  charge  transport  models  are 
postulated  and  analyzed  in  order  to  determine  the  maximum  quantum  limited 
space  charge  amplitude  that  can  be  product  by  a  fixed  number  of  phom- 
generaied  mobile  charge  carriers  at  a  given  spatial  frequency.  The  grating 
recording  efficiencies  of  several  such  idealized  models  are  then  calculated 
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to  form  a  set  of  baseline  cases  against  which  the  corresponding  efficiencies 
of  more  realistic  photorefractive  recording  models  can  be  directly  compared. 
For  one  such  model,  that  of  a  single  mobile  charge  species  transported  be< 
tween  a  single  type  of  donor  site  and  iu  associated  (ionized)  trap  sites,  sev¬ 
eral  factors  are  identified  that  contribute  to  the  grating  recording  efficieiKy 
and  that  are  particularly  illustrative  of  the  fundamental  limitations  on  the 
photorefractive  sensitivity  inherent  in  the  model. 

3.1.  Constraints  Common  to  Alternative  Models 
of  Photoieftactive  Recording 

The  idealized  photogeneration  and  charge  transport  models  defined  herein 
are  abstractions  of  a  mote  realistic  model  that  has  been  studied  by  many 
authors  and  has  been  analyzed  in  considerable  detail  by  Kukhtarev  et  al. 
(1976.  1979),  and  as  such  these  models  share  key  assumptions  concerning 
quantum  recording  limitations  and  implications  of  the  readout  process.  For 
simplicity,  we  confine  our  attention  herein  to  a  version  of  the  model  char¬ 
acterized  by  a  single  mobile  charge  species,  and  a  single  type  of  donor  site 
with  associated  un-ionized  donor  and  ionized  donOT  (trap)  states,  although 
mote  intricate  models  have  been  proposed  for  particular  materials  sytems 
[e.g.,  the  existence  of  mobile  holes  as  well  as  electrons  in  lithium  niobate 
(Orlowski  and  Kratzig,  1978)  and  barium  titanate  (Strohkendl  et  al.,  1986), 
or  the  existence  of  multiple  trap  levels  in  bismuth  silicon  oxide  (Attard  and 
Brown,  1986;  Valley.  1986)]. 

Four  principal  material  species  are  considered  in  the  single  mobile  charge/ 
single  trap  level  model,  as  diagrammed  in  Fig.  3. 1 .  The  mobile  charge  spe¬ 
cies  (usually  electrons)  has  number  density  n(x,  t)  and  is  represented  by  the 
symbol  e~  in  Fig.  3. 1.  It  is  assumed  that  negligible  densities  of  these  mobile 
charges  exist  under  dark  conditions;  essentially  all  mobile  charges  are  cre¬ 
ated  by  photoionization  of  donors,  in  this  model,  donors  and  traps  are  as¬ 
sumed  to  be  different  valence  stales  of  the  same  impurity  atom  (e.g.,  iron 
in  lithium  niobate)  or  lattice  defects,  as  diagranuned  in  Fig.  3.2.  The  total 
number  of  such  impurity  ions  or  defects  is  distributed  uniformly  throughout 
the  crystal  at  potential  donor  sites.  A  donor  is  converted  into  a  trap  (ionized 
donor),  simultaneously  with  the  creation  of  a  mobile  charge,  by  photoge¬ 
neration;  conversely,  a  mobile  charge  is  removed  from  the  conduction  band, 
and  a  trap  is  convened  into  a  donor,  by  recombination  (Fig.  3.2).  The  sum 
of  the  number  densities  of  donon  and  ionized  donors  is  denoted  by  No, 
which  is  dierefoie  the  total  density  of  potential  donor  sites  and  is  assumed 
to  be  constant  in  space  and  time.  If  the  number  density  of  ionized  donors 
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is  taken  to  be  N^ix.  /).  then  the  number  density  of  (un-ionized)  donors  is 
[/Vq  -  /Vo(.c.  r)).  In  Fig.  3.1.  the  donors  are  represented  by  the  symbol  O 
and  the  traps  are  represented  by  the  symbol  + .  A  fourth  material  species  is 
needed  in  this  model  to  ensure  charge  conservation,  as  the  density  of  ionized 
donors  NoiJC.  t)  frequently  exceeds  the  density  of  mobile  charges  nix.  t)  by 
several  orders  of  magnitude.  This  fourth  species,  with  number  density 
has  traditionally  been  called  an  acceptor,  but  is  also  called  a  charge  com¬ 
pensation  site  herein  and  is  represented  in  Fig.  3.1  by  the  symbol  -.  The 
charge  compensation  sites  are  electrically  negative  with  respect  to  the  donors 
and  are  presumed  to  be  negatively  charged  impurity  ions  or  lattice  defects 
incorporated  during  the  crystal  growth  process.  The  density  is  assumed 
to  be  constant  in  space  and  time,  and  is  further  assumed  to  be  numerically 
equal  to  No(x.  r  -  0),  the  concentration  of  ionized  donors  in  equilibrium. 
In  point  of  fact,  the  only  requirement  for  charge  compensation  is  that  the 
product  of  the  charge  compensation  site  density  and  the  effective  number 
of  negative  charges  on  each  be  equal  to  /Vo(jr.  r  ==  0). 

It  is  instructive  to  consider  the  relative  densities  of  these  various  species. 
Consider,  for  example,  a  crystal  of  bismuth  silicon  oxide  (BiijSiOjo),  Typ- 
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SchcmaUc  lepreaenimon  of  the  single  mobile  charge/aingle  tap  level  photo* 
letactKc  recording  model,  Indicaiing  both  the  unllbrm  distributions  and  the 
relative  densities  of  the  four  principal  material  species  Involved  prior  to  grating 
recording 
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Schematic  representation  of  the  single  active  species  photoreftactivc  recording 
model  indicating  the  electron  photogeneration  (and  recombination)  processes 
involved  in  the  transformation  of  donors  into  traps  (and  traps  into  doctors). 


icai  concentrations  of  donors,  charge  compensation  sites,  and  mobile  charges 
under  reasonable  optical  intensities  are  of  order  10'*  cm~\  10'*  cm'\  and 
no  more  that  10'*  cm'\  respectively  (Peltier  and  Micheron.  1977;  see  also 
Hou  et  al.,  1973).  (The  number  density  of  electrons  under  dark  conditions 
for  typical  crystals  of  bismuth  silicon  oxide  is  entirely  negligible.)  Thus.  No 
»  No  ^  N/i,»  n.  A  number  of  other  photorefractive  media  exhibit  similar 
proportiorudities.  For  these  materials,  the  total  space  charge  density  p(x.  f) 
is  given  by 

p(x.  t)  *  e{AfD(-'>  f)  “  f)  “  ^aJ  *  elNo(.x,  t)  -  N/^J.  (3.5) 

The  local  density  of  ionized  donors  Ni  can  be  spatially  redistributed  under 
the  influence  of  inhomogeneous  photogeneration,  as  shown  by  comparing 
Figs.  3.1  and  3.3.  This  redistribution  occurs  as  follows.  A  photon  is  ab¬ 
sorbed  by  a  donor,  converting  the  donor  into  a  trap  (ionized  donor)  and 
generating  a  mobile  charge  carrier.  The  mobile  charge  carrier  is  transported 
some  distance  through  the  photorefractive  medium  due  to  drift  and/or  dif- 
fiuion,  and  it  is  subsequently  captured  by  a  trap  thus  generating  another 
donor.  If  we  choose  to  follow  a  particular  electron,  the  entire  process  can 
be  viewed  as  a  simple  exchange  between  equivalent  donor  sites  of  a  donor 
stale  with  an  ionized  donor  (or  trap)  state. 

As  a  final  note,  the  space  charge  profile  p(x.  r)  might  evolve  into  a  highly 
distorted  profile  with  respect  to  the  distribution  of  incident  illumination  be¬ 
cause  of  nonlinearities  inherent  in  the  recording  process.  However,  since  the 
grating  readout  is  typically  performed  deep  within  the  Bragg  regime,  at  most 


72 


R.V.  Johnson  and  A.R.  Tanguay,  Jr. 


rig.  3.3. 


0000fl00000000000a0«fl00«fl09fl000000000»00800000000000000000000 

oooooooooaooooaoaooooooaoooooaeooooooooooooooooflooosaooooooo 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaeoaaaaeaaeeaaaaeaeaeeBBBa 

aBa9Ba9a9999B999BaB99aeBeBB9a998B8BBB999BBB9BB90e9BBeeeeeB9B 

9BBa99990B99ea99BBa00999999BBB9B9aB9B99B99BBBB999Be90aeeea90 

90a99aoaB9a99BOOaaBaB99899Be9e9BB999BBa9B9999BBOBaBBOa9eOB99 

aaaeo99a9099ao9999Be99aoaae9B9e99BB89BeoeaB9B99Ba9B8eBeeeoee 

888888888988898888888888898888888888998889098990888880888888 

998999990989989990899909998900080900009000000000000000009900 
000000080800800008000080800008880000000000000000080000008008 
089880808008880880080800000880800000000000000000000000080880 
088008888800808800000888880800080000000000008000008000000088 
088800000090000000000000000880800088008800000800000000000000 
808000000000000000000000000800000000008800000000000000000000 
080080880880880000008000088808000008808800000000000000008000 
000000000000800800000080000000000000000080000000080000008000 
dflAadOaaaOOdddOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO 
ooodoodOdOQOddOOdddOddooooooddooooooooooo^oooooooooooooooooo 
OOOOOOOQOOOOOOOOOOdOOOOOOOOOOOOOOOOOOOOOOOjOOOOOOOOOOOOOOOOO 
0000  0000000000000000000  0000000000000000000  000000000000000 
000  00000000000000000  00000000000000000  00000000000000 
QOO  000  000 

0  0  0 


Schematic  representation  of  the  single  mobile  charge/single  Imp  level  pholo- 
rehactlve  recording  model  Indicating  the  spatial  redlstilbulion  of  the  four  prin¬ 
cipal  material  species  following  grating  recording.  Note  that  the  sum  of  the 
donor  and  trap  densities  is  space-invaiianL  The  symbols  ktenUiying  each  spe¬ 
cies  are  given  in  Hg.  3. 1. 

a  very  limited  range  of  spatial  frequencies  is  effective  in  difftacting  the  read¬ 
out  light.  Thus  only  one  spatial  harmonic  of  the  space  charge  field  is  of 
interest,  which  is  assumed  herein  to  be  the  fundamental  harmonic. 

In  summary,  the  photoreftactive  recording  model  studied  by  Kukhtarev 
and  all  idealized  photogeneration  and  charge  transport  abstractions  consid¬ 
ered  in  the  following  analysis  share  these  basic  assumptions:  a)  only  a  finite 
amount  of  space  charge  (^5(JC,  r  -  0)  »  AfJ  exists  for  generating  the  space 
charge  electric  field,  b)  this  space  charge  can  be  spatially  redistributed  under 
the  influence  of  an  illumination  pattern,  c)  no  more  than  one  mobile  charge 
is  generated  for  each  absorbed  photon  of  the  illumination  beam,  and  d)  only 
the  fundamental  spatial  harmonic  of  the  space  charge  is  effective  in  the  hol¬ 
ographic  readout  process.  The  photorefractive  recording  model  and  the 
idealized  models  differ,  however,  in  the  details  of  the  photogeneration  and 
charge  transport  processes,  as  discussed  next. 

3.2.  Mchltocd  fhotogcncmtlon  and  Chaige  Trmwpoet  Models 
The  grating  recording  efficiency  comprises  three  successive  physical  pro¬ 
cesses:  photogeneratioo.  charge  transport,  and  trapping.  In  order  to  deter- 
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mine  the  maximum  quantum  limited  space  charge  field  (at  unity  dielectric 
constant)  that  can  result  from  a  fixed  number  of  phocogenerated  mobile  charges, 
we  investigate  four  idealixed  photoger  '  =on  and  charge  transport  models, 
as  defined  below. 

The  photogeneration  process  is  controlled  by  the  recording  illumination 
profile.  Two  alternative  profiles  are  considered  herein,  a  periodic  comb 
function  and  a  sinusoidal  function.  The  comb  illumination  profile  /c(x),  as 
shown  in  Fig.  3.4,  is  defined  by 

/dJc)  =  I  CO  ^  ^co  2  ^ 

in  which  the  arrow  ^  indicates  that  the  desired  function  asymptotically  ap¬ 
proaches  a  sequence  of  Dirac  delu  functions,  i.e..  a  series  of  intensity  peaks 
with  spatial  extent  small  compared  with  the  spatial  period  Ao.  Unlike  a 
mathematical  delu  function,  however,  the  intended  comb  peaks  ate  assunted 
to  be  large  enough  to  overlap  a  reasonable  number  of  donor  sites.  Note  that 
the  periodic  comb  function  does  not  correspond  to  any  normal  recording 
configuration.  This  is  acceptable  for  purposes  of  this  analysis  because  the 
maximum  quantum  limited  space  charge  field  at  uirity  dielectric  constant  is 
intended  to  define  an  upper  limit  against  which  more  realistic  models  might 
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SctwmaUc  lepieanUntion  of  the  two  pdndpal  Htumlnatton  proAIca  (comb  and 
sinuaaM  hmetiom)  udHaed  in  the  Idealhed  pnotogenenUon  and  charge  trana. 
portmodeia.  The  comb  lUncdon  ia  Bluatiaim  with  Wnhe  width  to  Incorporate  a 
given  number  of  donors  at  a  predetermined  donor  density. 
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be  compared.  The  sinusoid  illumination  profile  /$(-<)  of  unity  naodulation 
depth  is  defined  by 


/  s(jr)  ~  /so 


(3.7) 


in  which  .\c  is  the  spatial  period  of  the  illumination  profile.  To  compare 
these  two  illumination  profiles,  the  same  photon  flux  is  assumed:  that  is. 
the  scaling  parameters  /so  and  Ico  are  adjusted  such  that  the  following  nor¬ 
malization  integral  is  satisfied; 


(3.8) 


from  which  we  derive  the  relation  that  /so  ^  /co  *  /o-  In  the  idealized  pho¬ 
togeneration  models,  all  photons  in  the  illumination  profile  are  assumed  to 
be  absorbed  and  to  generate  mobile  charge  carriers. 

Two  alternative  models  of  charge  transport  and  trapping  are  also  consid¬ 
ered:  half  wavelength  translation,  and  randomization  (Figs.  3.S  and  3.6).  In 
the  half  wavelength  translation  process  shown  schematically  in  Fig.  3.S. 
each  electron  is  assumed  to  translate  precisely  half  the  grating  period  of  the 
illumination  profile  before  capture,  without  diffusive  blooming  of  the  elec- 


SchcmaUc  icpwaanlBtion  of  the  two  principal  charge  redieMbiidon  media- 
niama  (half  wavdcngtti  tranaMion  and  landomisailon)  uUHaed  In  the  MeaNfcd 
phologcneralian  and  charge  lianaport  modelft  for  the  caae  of  comb  mumina- 
tion.  A  single  mobile  charge  ape^  (dectrona)  la  laaianed  for  purpoaea  of 
Hluabatlon. 
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Schematic  lepreacntttkm  of  the  two  principal  charge  redistribution  mecha¬ 
nisms  (half  wavelength  translation  and  randomization)  utilized  in  die  idealized 
photogeneration  and  charge  transport  models,  for  the  caae  of  sinusoidal 
illumination. 

tron  cloud.  In  the  analysis  that  follows,  it  will  be  shown  that  half  wavelength 
translation  corresponds  to  the  quantum  limited  charge  transport  process,  giv¬ 
ing  rise  to  the  maximum  grating  recording  efTiciency.  The  second  idealized 
transport  and  capture  process  to  be  considered  herein  is  complete  random¬ 
ization.  as  shown  schenuuically  in  Fig.  3.6,  in  which  the  photogenerated 
electrons  are  assumed  to  redistribute  randomly  until  they  exhibit  no  spatial 
variation  in  density,  and  only  then  are  recaptured'  by  local  traps.  The  ran¬ 
domization  model  is  intended  to  represent  a  mote  realistic  charge  transport 
model;  even  so,  this  model  corresponds  to  an  upper  bound  on  realizable 
charge  transport  efficiencies,  as  shown  in  the  next  section. 

Combination  of  the  two  photogeneration  models  and  the  two  charge  trans¬ 
port  and  trapping  models  produces  four  alternative  idealized  recording  models 
for  comparison  (Fig.  3.7),  which  will  hereinafter  be  identified  as  the  bipolar 
comb,  the  monopolar  comb,  the  transport-efficient  sinusoid,  and' the  baseline 
sinusoid.  Each  of  the  four  combinations  will  be  considered  in  turn.  Grating 
recording  efficieiKies  are  calculated  for  these  four  combinations  in  Section 
3.2.1,  and  the  corresponding  space-charge-field  saturation  behavior  is  con- 
sitkied  in  Section  3.2.2. 
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rjg,  3.7. 


llluminoiion  Profil* 

Comb  Sinusoid 


Matrix  representation  of  the  four  Ideaiized  photogeneration  and  charge  trans¬ 
port  modeiSk  as  derived  rrom  the  pritKipai  illumination  proAlcs  and  charge 
transport  mechanisms. 


3.2. 1.  Idealized  Orating  Recording  efficiencies 

The  bipolar  comb  results  ftom  a  periodic  comb  illumination  function  com* 
bined  with  a  half  wavelength  translation.  The  resulting  space  charge  density 
p(x)  induced  by  this  illumination  and  ttanspott  combination  is  a  periodic 
comb  function  superimposed  on  a  periodic  comb  function  of  opposite  sign 
shifted  by  half  of  a  grating  wavelength,  as  shown  in  Fig.  3.8.  For  a  given 
peak  phoeogenerated  charge  density  pp  (proportional  to  t^t,  in  which  r  is  the 
exposure  time),  the  resultant  space  charge  distribution  p(jr)  is  given  by 

PW  .  P.[coml.(;^  -  coml.(l  *  (3.9) 

This  space  charge  profile  p(jc)  can  be  readily  integrated  [see  Eq.  (3. 1)]  to 
yield  a  space  charge  field  E(x)  that  is  a  square  pulse  train  (Fig.  3.8).  The 
fust  spatial  hannonk  component,  £, ,  defined  by 


£i  »  2A^'  j  E{x)  $in^~^  dx. 


(3.10) 


has  a  tnagnitude  of 


£,-4e- 


MqITo 

in  which  Kq  •  I-x/Sq  is  the  wave  vector  of  the  ilhmiinatioo  profile. 


(3.11) 
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Space  charge  density  and  Held  proAles  for  die  bipolar  comb  distribution.  The 
first  spatial  harmonic  of  the  space  charge  field  la  represented  by  the  dashed 
curve  and  Is  scaled  to  the  ma^tude  of  the  total  space  charge  field. 


The  monopolar  comb  commands  interest  because  it  can  support  the  high* 
est  space  charge  field  before  saturation  due  to  limited  ionized  trap  density, 
as  discussed  in  the  next  section.  The  monopolar  comb  results  from  a  periodic 
comb  illumination  function  combined  with  electron  randomization.  The  re¬ 
sulting  space  charge  profile  p<jr)  induced  by  this  illumination  and  transport 
combination  is  a  periodic  comb  function  superimposed  on  a  uniform  back¬ 
ground  of  opposite  charge,  as  shown  in  Fig.  3.9.  The  resulting  charge  dis¬ 
tribution  is  represented  by 

p(Jf)  ■  Po^comb^^j  -  1  j.  (3,12) 

The  corresponding  space  charge  field  exhibits  a  sawtooth  profile  (Fig.  3.9). 
and  the  magnitude  E,  of  its  first  spatial  harmonic  is 

£i-2e-^.  (3.13) 

The  transport-efficient  sinusoid  results  from  a  sinusoidal  illumination  pro¬ 
file  combined  with  a  half  wavelength  translation,  as  shown  schematically  in 
Fig.  3. 10,  leading  to  a  charge  density  p(jc)  given  by 
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and  a  first  spatial  harmonic  field  component  £,  of 


(>o 

eco  ATg 


(3. 14) 


(3. 15) 


The  transport-efficient  sinusoid  combination  is  included  for  completeness, 
but  is  not  emphasized  because  it  is  neither  realistic  nor  does  it  correspond 
to  any  upper  bound  of  grating  lecotding  efficiency  or  saturation  performance. 

The  baseline  sinusoid,  although  seemingly  artificial,  is  of  very  pro¬ 
nounced  interest  because  it  reptesents  an  asymptotic  upper  bound  on  the 
grating  recording  efficiency  predicted  by  more  realistic  recording  models, 
as  discussed  in  Section  3.3  below.  The  baseline  sinusoid  results  from  a  sin¬ 
usoidal  illumination  profile  combined  with  randomization  of  the  electron 
distribution.  The  resulting  space  charge  distribution  exhibits  a  sinusoidal 


Space  charge  denatty  and  field  profiles  for  the  monopolar  comb  diatrlbulkin. 
The  fbat  apadal  hannonic  of  the  apace  charge  Held  Is  represented  by  the  dashed 
curve  and  Is  scaled  to  the  magnitude  of  the  total  apace  charge  field. 
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Space  ctuuge  density  and  Held  profiles  for  the  transport.efncient  (first  scale  val¬ 
ues)  and  tiaaeUne  sinusoid  (second  scale  values)  distributions.  The  space  charge 
fields  are  scaled  to  the  magnitude  of  the  transport-efficient  case. 


profile,  as  does  the  corresponding  space  charge  field,  as  shown  in  Fig.  3. 10. 
The  cl.arge  density  is  given  by: 


/2vx\ 

p(jr)  *  po  j  •  (3. 16) 

corresponding  to  a  first  spatial  harmonic  field  component  £,  of 


El 


Po 

e - . 

cCoATg 


(3.17) 


As  can  be  seen  from  the  analysis  above,  the  bipolar  comb  photogenera¬ 
tion/charge  transport  combination  generates  the  maximum  quantum  limited 
space  charge  field  at  unity  dielectric  constant,  and  hence  provides  the  nor¬ 
malization  constant  required  for  evaluation  of  the  grating  recording  effi¬ 
ciencies  of  both  idealized  and  realistic  photorefractive  recording  models. 
Physically,  this  optimum  combination  of  phoiogeneration  and  charge  trans¬ 
port  results  from  the  maximum  possible  average  separation  of  the  positive 
and  negative  charge  distributions,  as  well  as  from  the  fact  that  the  amplitude 
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of  the  first  hannonic  of  a  square  wave  exceeds  the  amplitude  of  the  square 
wave  itself. 

The  grating  recording  efficiencies  for  the  four  idealized  phototefractive 
grating  recording  combinations  are  shown  in  Table  3.1.  The  most  quantum- 
efficient  recording  occurs  for  the  bipolar  comb,  which  therefore  is  assigned 
a  grating  recording  efficiency  of  unity.  The  next  most  efficient  configura¬ 
tions  are  the  monopolar  comb  and  the  transport-efficient  sinusoid,  which 
both  yield  a  grating  recording  efficiency  of  O.S.  The  least  efficient  config¬ 
uration  is  the  baseline  sinusoid,  with  a  grating  recording  efficiency  of  0.25. 
As  we  shall  show  later,  the  efficiency  of  the  baseline  sinusoid  is  an  asymp¬ 
totic  upper  limit  of  more  realistic  recording  models.  Table  3.2  gives  the 
relative  phototefractive  grating  recording  sensitivities  for  these  four  cases  in 
terms  of  diffraction  efficiency  per  unit  incident  photon  flux,  assuming  low 
diffraction  efficiencies  such  that  the  efficiency  is  proportional  to  the  square 
of  the  space  charge  field  component  £, .  Note  that  by  this  measure  the  sen¬ 
sitivity  of  the  baseline  sinusoid  is  ttegraded  by  a  factor  of  16,  over  an  order 
of  magnitude,  compared  with  the  quantum  limit  represented  by  the  bipolar 
comb  charge  distribution  function. 

3.2.2.  Space  Charge  Saturation  for  the  Idealized  Models 

Not  only  is  the  phototefractive  recording  sensitivity  of  concern,  but  also 
saturation  limitations  of  the  space  charge  field  occurring  because  of  limited 
ionized  trap  density.  Consider,  for  example,  the  bipolar  comb  example  shown 
in  Fig.  3.8.  The  regions  of  positive  space  charge  grow  by  the  photoexcitation 
of  neutral  donors,  which  converts  tlKm  into  positively  ionized  donors  (traps) 
and  generates  mobile  electrons.  The  electrons  are  removed  from  this  region 
by  the  various  transport  processes,  leaving  behind  the  positively  ionized  do¬ 
nors.  As  will  be  shown  later,  it  is  these  regions  of  net  positive  space  charge 
that  primarily  contribute  to  the  buildup  of  the  space  charge  field.  The  regions 
of  negative  space  charge  grow  by  the  reverse  process,  i.e.,  by  capturing 
mobile  electrons  at  local  ionized  donor  sites  to  form  neutral  donors.  In  pho- 
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torefractive  crystals  that  initially  are  in  quasi-thermodynamic  equilibrium, 
the  density  of  donors  typically  far  exceeds  the  density  of  ionized  donors, 
which  implies  that  the  regions  of  negative  space  charge  will  saturate  first. 
This  corresponds  to  the  complete  conversion  of  all  ionized  donors  into  neu¬ 
tral  donors  at  the  peaks  of  the  negative  space  charge  distribution,  resulting 
in  the  local  complete  cancellation  of  AfoCx.  t  =  0)  =  as  shown  sche¬ 
matically  in  Fig.  3.11. 

The  bipolar  comb  combination  exhibits  by  far  the  most  rapid  charge  sat¬ 
uration  at  the  lowest  space  charge  field  of  die  four  combinations  considered 
herein,  because  for  this  combination  the  electrons  after  transport  are  con¬ 
centrated  into  a  very  small  volume,  with  a  correspondingly  small  number 
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Saturation  of  the  space  charge  modulation  for  Ore  bipolar  comb;  monopolar 
comix  and  sinusoid  charge  distributions  as  a  result  of  the  tinite  available  trap 
density.  An  initially  uniform  trap  density  of  ffo*(0)  is  asaumed  prior  to  photo- 
generation  and  charge  redistribution. 


82 


R.V.  Johnson  and  A.R.  Tanguay,  Jr. 


of  ionized  donors  available  to  capture  these  electrons.  Conversely,  the  mon¬ 
opolar  comb  combination  exhibits  the  slowest  saturation  at  the  highest  space 
charge  field  because  the  electrons  are  uniformly  distributed  throughout  the 
volume  of  the  photorefractive  medium  and.  hence,  can  be  captured  by 
all  of  the  ionized  donors.  The  saturation  characteristics  of  the  sinusoidal 
combinations  are  intermediate  between  the  bipolar  and  the  monopolar 
combinations. 

The  relative  photorefractive  grating  recording  sensitivities  (grating  re¬ 
cording  efficiencies)  and  saturation  characteristics  of  these  four  photoge- 
neradon/charge  transport  combinations  are  schematically  diagrammed  in  Fig. 
3. 12.  The  photosensitivities  are  indicated  by  the  initial  linear  slopes  and  the 
saturation  by  the  final  space  charge  field  levels.  The  space  charge  field  is 
plotted  here  in  units  of  £q .  which  is  defined  as  eNjttoKa  (Amodei.  1971). 
(This  expression  is  valid  when  the  density  of  charge  compensation  sites 
is  much  smaller  than  the  total  density  of  potential  donor  sites  No  ■ )  Note  that 
the  saturation  level  for  the  bipolar  comb  should  really  be  much  closer  to  the 
horizontal  axis:  it  has  been  overstated  for  clarity  of  illustration. 

Having  calculated  the  grating  recording  efficiencies  and  saturation  fields 
for  these  four  highly  idealized  cases,  we  now  proceed  to  compare  these  ideal 
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results  with  more  realistic  photogeneration  and  charge  transport  models  in 
the  next  two  sections. 


3.3.  Realistic  Recording  Models 

The  ide^ized  photorefractive  recording  models  discussed  above  allow  the 
effects  of  the  photogeneration  profile  and  charge  transport  process  on  the 
grating  recording  efficiency  to  be  assessed  relative  to  the  fundamental  quan¬ 
tum  limits.  We  now  examine  a  more  realistic  model  applicable  to  a  wide 
range  of  commonly  investigated  photorefractive  media,  which  exhibits  an 
overall  efficiency  degraded  from  that  of  the  baseline  sinusoid  case  presented 
above  by  several  additional  factors.  In  this  model,  the  photogeneration  pro¬ 
file  is  assumed  to  be  sinusoidal,  and  the  effects  of  the  photogeneration  quan¬ 
tum  efficiency,  absorption  coefficients,  and  reflection  losses  are  assumed  to 
be  space-invariant  efficiency  factors  and  hence  can  be  directly  incorporated 
in  any  estimate  of  the  photorefractive  grating  recording  sensitivity.  Another 
major  factor  is  an  inherent  inefficiency  in  the  charge  transport  process;  this 
inefficiency  is  studied  in  Section  3.3. 1  using  analytical  solutions  derived  by 
Young  et  al..  ( 1974;  see  also  Moharam  et  al..  1979).  which  are  valid  in  the 
initial  recording  interval  before  significant  space  charge  fields  have  evolved. 
A  related  factor  derives  from  the  reduced  recording  sensitivity  exhibited  as 
the  space  charge  field  approaches  its  steady  state  limit.  This  is  studied  in 
Section  3.3.2  using  analytical  solutions  derived  by  Kukhtarev  (1976)  that 
describe  the  temporal  evolution  of  the  space  charge  field  in  the  limit  of  very 
low  illumination  profile  modulation  depths.  Recording  configurations  that 
generate  low  modulation  depths  are  inherently  inefficient,  as  most  of  the 
photons  in  the  illumination  contribute  a  uniform  background  photocunent. 
and  only  a  fraction  of  the  incident  intensity  contributes  to  the  spatial  struc¬ 
ture  of  the  image.  The  modulation  depth,  therefore,  is  also  a  factor  that 
reduces  the  photoreftactive  grating  recording  sensitivity.  However,  low 
modulation  depths  are  necessary  in  certain  recording  techniques  for  en¬ 
hancing  the  space  charge  field  in  the  steady  state  limit,  such  as  the  running 
grating  process  discussed  in  Section  3.3.3. 

3.3. 1.  Initial  Recording  Sensitivity 

To  determine  the  existence  of  degraded  charge  transport  efficiency,  the 
idealized  recording  models  discussed  in  Section  3.2  must  be  compared  with 
more  realistic  charge  transport  solutions,  such  as  those  given  by  Young  et 
al.  (1974;  see  also  Moharam  et  al..  1979).  Under  normal  recording  condi- 
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tions.  (h«  coupled  photorefractive  recording  equations  are  nonlinear,  making 
analytical  solutions  difficult  or  impossible  to  derive.  However,  a  signifi¬ 
cantly  simplified  analysis  can  be  utilized  during  the  initial  recording  period, 
which  enables  analytical  solutions  to  be  derived  at  least  for  certain  illumi¬ 
nation  profiles.  These  analytical  solutions  are  well  worth  studying  for  the 
physical  insight  they  furnish  into  the  charge  transport  process. 

The  analytical  simplification  described  above  derives  from  a  linearization 
of  the  recording  equations,  in  which  two  recording  parameters,  the  toul 
electric  field  and  the  ionized  donor  (trap)  density,  remain  essentially  con¬ 
stant  throughout  space  during  the  initial  recording  interval  (Young  et  al.. 
1974:  Moharam  et  al.,  1979).  This  assumes  that  the  photorefractive  crysul 
is  initially  in  quasi-thermodynamic  equilibrium,  i.e..  with  a  spatially  uni¬ 
form  distribution  of  ionized  donors  N^tx.  r  =  0)  =  and  that  the  trap 
density  remains  essentially  constant  throughout  this  initial  recording  interval. 

The  analytical  solutions  that  exist  in  this  regime  have  typically  empha¬ 
sized  recording  with  sinusoidal  illumination  profiles.  While  such  a  profile 
corresponds  closely  with  typical  experimental  situations  and  simplifies  the 
mathematics,  the  physics  of  the  transport  process  is  somewhat  obscured  in 
comparison  with  an  alternative  illumination  profile,  that  of  a  very  narrow 
slit,  which  can  be  viewed  as  an  approximation  of  a  Dirac  delta  function. 
Typical  trapped  electron  density  profiles  obtained  in  response  to  a  narrow 
slit  illumination  profile  are  shown  in  Fig.  3.13  for  the  cases  of  diffusion- 
only  transport  (top  illustration),  drift-only  transport  (middle  illustration),  and 
one  particular  combination  of  drift  and  diffusion  processes  (bottom  illustra¬ 
tion);  the  derivation  of  these  figures  is  described  below.  These  figures  em¬ 
phasize  several  important  features  of  a  more  realistic  transport  analysis.  The 
transport  mechanism  is  inherently  a  random  process,  with  a  spread  in  char¬ 
acteristic  transport  lengths  associated  with  a  corresponding  spread  in  charge 
carrier  lifetimes.  Useful  parameters  for  characterizing  the  transport  processes 
are  the  average  transport  lengths  Le  for  drift-induced  transport  and  Lo  for 
diffusive  transport,  defined  as  (Young  et  al..  1974;  Moharam  et  al..  1979) 

(3.18) 


and 

/k  T\''' 

to  »  (Dt)''*  »  ( (HT)''- .  (3. 19) 

in  which  )i  is  the  mobility  of  the  charge  carriers.  £o  is  the  applied  bias 
electric  field,  r  is  the  charge  carrier  lifetime.  D  is  the  diffusion  coefficient 
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Typical  trapped  electron  density  proflIes  obtained  in  response  to  a  narrow  silt 
illumination  profile.  The  average  transport  lengths  to  for  dirfusive  transport  and 
Lt  for  drlfl'ltiduced  transport  are  defined  in  the  text. 


for  the  charge  carriers,  icg  is  Boltzmann's  constant.  T  is  the  crystal  temper* 
ature.  and  e  is  the  charge  of  an  electron.  Einstein's  relation  between  the 
diffusion  coefficient  and  the  mobility  has  been  used  in  Eq.  (3.19).  Note  that 
in  both  cases  the  transport  lengths  ate  functions  of  the  |i.T  product. 

For  the  diffusion>only  case,  the  electron  spread  is  symmetrical,  which 
introduces  no  net  phase  shift  in  the  photorefractive  response  to  any  arbitrary 
illumination  profile.  For  the  drift  case,  as  well  as  for  the  combined  drift/ 
diffusion  case,  the  electron  distribution  is  skewed  to  one  side  by  the  presence 
of  an  applied  bias  field,  which  does  in  fact  introduce  a  phase  shift  when 
recording  particular  illumination  profiles,  as  shown  in  Fig.  3.13. 

Now  consider  an  illumination  profile  Hx)  that  is  sinusoidal  and  of  the  form 

i(x)  =  /o[l  +  mcosfifcT)),  (3.20) 

in  which  m  is  the  modulation  depth  of  the  light  profile  and  Kc  is  the  wave 
vector  associated  with  the  interferetKe  pattern.  For  such  an  illumination  pro¬ 
file,  Young  et  al.  (1974;  see  also  Moharam  et  al.,  1979)  have  derived 
expressions  for  the  growth  of  the  space  charge  field  during  the  initial  re¬ 
cording  interval.  When  transport  is  dominated  by  diffusion,  the  initial  growth 
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of  (he  first  harmonic  component  £,  of  the  space  charge  field  is  expressed 
by 


(3.21) 


in  which  is  the  photogeneration  rate  and  t  is  the  time  relative  to  the  ini¬ 
tiation  of  grating  recording.  The  initial  growth  when  drift  transpott  domi¬ 
nates  is  expressed  by 

£,  =m[^^]|^ArG£E(l  +  (3.22) 


in  which  the  phase  shift  is  defined  by 

tan  <t>  ~  KqL^.  (3.23) 

The  first  bracketed  term  [tego/cco^c)  Eqs.  (3.21)  and  (3.22)  corre¬ 
sponds  to  the  grating  recording  efficiency  prediaed  by  the  baseline  sinusoid 
model  (with  po  =  tego),  as  discussed  in  Section  3.2.  This  represents  the 
upper  bound  on  achievable  recording  sensitivity.  The  second  bracketed  terms 
in  Eqs.  (3.21)  and  (3.22)  correspond  to  an  additional  charge  transport  inef¬ 
ficiency  inherent  in  more  realistic  models  of  photorefractive  recording,  the 
subject  of  this  section.  This  transport  inefficiency  factor  is  plotted  as  a  func¬ 
tion  of  increasing  transport  length  in  Fig.  3. 14  for  diffusion-only  transport 
and  in  Fig.  3.  IS  for  drift-only  transport.  Recall  that  these  curves  apply  only 
during  the  initial  recording  interval,  before  significant  space  charge  has  ac¬ 
crued.  Later  recording  will  be  characterized  by  a  lower  transpott  efficiency 
because  of  the  presence  of  the  space  charge  field.  Note  in  Figs.  3. 14  and 
3.  IS  that  the  charge  transpott  efficiency  asymptotically  approaches  its  max¬ 
imum  value  in  the  limit  of  very  long  transport  lengths,  as  intuitively  ex¬ 
pected.  although  even  in  this  limit  the  nuximum  recording  sensitivity  is  that 
of  the  baseline  sinusoid,  not  that  of  the  transport-efficient  sinusoid. 

The  reason  that  the  recording  sensitivity  only  reaches  the  baseline  sinusoid 
level  in  this  limit  is  best  understood  by  considering  the  spatial  modulation 
profile  of  the  mobile  charge  density  n(x).  Analytical  expressions  for  the 
modulation  depth  of  the  mobile  charge  density  can  be  readily  derived  from 
the  same  analysis  that  led  to  Eqs.  (3.21)  and  (3.22)  (Young  et  al..  1974; 
Mohaiam  et  al. ,  1979),  for  times  sufficiently  long  compared  with  the  mobile 
carrier  lifetime  so  that  the  mobile  charge  density  represents  a  quasi-steady- 
staie  distributkm,  and  also  sufficiently  shoit  so  as  to  remain  in  the  initial 
recording  regime.  In  the  diffusion-only  case,  the  mobile  charge  density  is 
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rig.  3.U. 


(SCALED  TO  the  GRATING  PERIOD  A  ) 

electron  density  modulation  depth  and  charge  transport  efflciency  as  a  hJiKtion 
of  the  diffusion-induced  transport  length,  in  the  diffusion-only  regime. 


n{x)  -  ygo 


[ 


1  +  nttiKcLo)  cos 


(3.24) 


in  which  t  is  the  mobile  carrier  lifetime  and  in  which  the  modulation  depth 
m^(KoLo)  is  given  by 


m,(  fCc^o)^ - (3. 25) 

I  KqL}} 

Note  that  this  charge  distribution  has  the  least  efficient  possible  phase  for 
building  up  a  space  charge  field.  Recall  from  the  discussion  in  Section  3.2.1 
that  the  optimum  phase  of  the  mobile  charge  profile  is  a  180”  phase  shift 
with  respect  to  the  incident  illumination  profile,  allowing  the  space  charge 
field  contribution  of  the  mobile  charge  after  subsequent  trapping  to  add  to 
the  contribution  of  the  excess  positively  charged  trap  profile  produced  by 
the  photogeneration  process.  Here,  however,  the  mobile  charge  profile  is 
aligned  coincident  with  the  illumination  and  with  the  excess  positively  charged 
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DRIFT  induced  transport  LENGTH  L 
(SCALED  TO  the  grating  PERIOD  A) 

Electron  density  modulation  depth  and  charge  transport  efficiency  as  a  function 
of  the  dtffi-induccd  tTMOport  leni^  in  the  (Mtt-oniy  regime,  the  resuMant  phase 
shift  of  the  mobile  eiectran  distribution  is  shown  at  the  top  of  the  figure. 


trap  profiles.  As  a  consequence,  when  a  mobile  charge  is  captured,  it  re¬ 
moves  one  of  the  incremental  photo^nerated  traps,  thereby  reducing  the 
overall  space  charge  profile.  A  net  space  charge  field  can  then  accrue  in  the 
diffusion-dominaied  transport  case  only  by  partially  randomizing  the  mobile 
charge  distribution,  corresponding  to  a  reduction  in  the  modulation  depth 
m^iKoLo).  Now  in  Fig.  3. 14  that  die  rise  in  charge  transport  efficiency  with 
increasing  transport  length  KqLo  is  coincident  with  a  reduction  of  the  mobile 
charge  modulatioa  depth,  as  expected.  Optimum  charge  transport  efficiency 
occurs  when  the  mo^  charge  profile  has  been  completely  randomized, 
which,  according  to  Eq.  (3.25).  occurs  in  the  limit  Ld  »  Aq  . 

Similarly,  for  drift-domtnated  transport,  the  mobile  charge  profile  has  a 
modulation  depth  of 

m 


(3.26) 
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and  a  phase  shift  of  6.  as  given  by  Eq.  (3.23).  The  charge  transport  effi¬ 
ciency.  mobile  charge  modulation  depth,  and  phase  shift  of  the  mobile  charge 
profile  with  respect  to  the  incident  illumination  are  plotted  in  Fig.  3. IS  as 
a  function  of  the  transpon  length  KqL^  for  drift-dominated  transport,  which 
is  in  turn  proportional  to  the  applied  bias  field  £^.  For  short  transport  lengths, 
the  phase  shift  is  close  to  0”,  which  as  pointed  out  above  is  the  least  efficient 
phase  for  building  a  space  charge  field,  and  the  modulation  depth  m,  exhibits 
its  maximum  value  of  m,  the  modulation  depth  of  the  illumination  profile. 
For  longer  transport  lengths,  the  mobile  charge  profile  phase  shifts  away 
from  0”.  but  is  always  less  than  90”.  and  hence  always  degrades  the  net 
space  charge  profile.  The  modulation  depth  m.  similarly  decreases  with  in¬ 
creasing  charge  transport  length.  The  most  efficient  charge  transport  occurs 
for  very  large  transpon  lengths.  KqL^  .  for  which  the  mobile  charge  profile 
has  become  completely  randomized. 

The  reason  that  the  phase  of  the  mobile  charge  profile  never  exceeds  a 
90”  phase  shift  for  drift-dominated  transpon  (and  that  the  phase  is  always 
0”  for  diffusion-dominated  transpon)  can  perhaps  best  be  appreciated  from 
Fig.  3.13,  which  shows  the  mobile  charge  profile  that  is  generated  in  re¬ 
sponse  to  a  narrow  slit  (i.e.,  very  tightly  focused)  illumination  profile.  The 
profiles  shown  in  Fig.  3 . 1 3  can  be  derived  by  Fourier  decomposing  the  Dirac 
delu  function  of  the  illumination  profile  into  an  equivalent  set  of  spatial 
harmonics,  applying  Eqs.  (3.25)  and  (3.26)  to  find  the  mobile  charge  profile 
in  response  to  each  spatial  frequency  and  then  performing  an  inverse  Fourier 
transform.  But  the  profiles  shown  in  Fig.  3.13  are  also  intuitively  reason¬ 
able.  Diffusion  tends  to  broaden  the  mobile  charge  density  synunetrically 
about  the  location  of  an  illumination  region,  whereas  drift  tends  to  pull  the 
mobile  charges  to  one  side.  In  addition,  note  that  the  mobile  charge  distri¬ 
bution  resulting  from  any  arbitrary  illumination  profile  can  be  derived  by 
convolving  the  illumination  profile  with  the  appropriate  distribution  shown 
in  Fig.  3  .13  (which  can  be  considered  to  be  a  blur  function).  Because  of 
the  monotonically  decreasing  shape  of  these  blur  functions,  no  phase  shift 
in  excess  of  90*  is  feasible. 

Thus  we  find  that  the  baseline  sinusoid  model  represents  an  upper  bound 
on  the  grating  recording  efficiency  predicted  by  the  single  mobile  charge 
species/single  donor/single  trap  phototefractive  recording  model. 

3.3.2.  Temporal  Approach  to  Steady  State 

In  addition  to  the  charge  transport  efficiency  factor  just  discussed,  two  ad¬ 
ditional  factors  must  be  considered  when  evaluating  any  realistic  recording 
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situation.  One  factor  pertains  to  the  saturation  in  temporal  growth  of  the 
space  charge  field,  resulting  in  a  reduced  growth  rate  as  the  field  approaches 
Its  steady  state  limit.  The  second  factor  pertains  to  the  modulation  depth  m 
of  the  illumination  profile,  which  is  often  chosen  to  be  small  to  enable  en¬ 
hanced  recording  techniques,  as  discussed  in  the  next  section. 

The  reduction  in  sensitivity  of  the  recording  process  as  the  space  charge 
field  approaches  its  steady  sute  limit  can  be  assessed  from  analytic  solutions 
that  have  been  derived  by  Kukhtaiev  (1976).  Typical  solutions  are  shown 
in  Fig.  3. 16  for  a  variety  of  bias  fields.  £„ .  scaled  to  the  maximum  possible 
field.  £,.  due  to  limited  ionized  trap  density  (discussed  in  Section  3.2.2). 
These  curves  were  generated  based  upon  typical  charge  mobility-lifetime 
product  (HT)  paiameters  for  bismuth  silicon  oxide  (Bi,,SiOa,;  BSO).  as  given 
in  Table  3.3.  (A  bias  field  to  saturation  fie‘.d  ratio  of  10:1  is  unphysical  for 
several  photorefractive  materials  such  as  bismuth  silicon  oxide,  requiring 
exceptionally  high  bias  fields  and/or  spatial  frequencies,  but  is  nonetheless 
included  for  generality.)  Note  tha  in  all  cases  shown  the  time  needed  to 
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Tabu  3.3 

Material  Parameters  Assumed  in  the  Calculations 


Bii..SiO:u 

BaTiO, 

Units 

Mobility-lifetime  product  iit 

IS 

0  5 

M-mVv. 

Trap  density 

1  X  10'* 

2  X  10'* 

cm' 

Oielectnc  constant  « 

56 

168  (c.,) 

Index  of  refraction  n,, 

2-5 

2.4 

Electroopiic  coefficient  r, 

5  (/-..) 

80  (r„) 

pm/V 

Id 

6  8 

pm/V 

lAt'ter  Valley  and  Klein.  1983.  and  references  cited  therein) 


reach  saturation  is  approximately  a  factor  of  two  longer  than  that  predicted 
by  the  baseline  sinusoid  idealized  transport  model. 

Due  to  the  fact  that  the  analytical  solutions  for  temporal  evolution  derived 
by  Kukhtarev  (1976)  are  valid  only  for  small  modulation  depths  m  of  the 
illumination  profile,  such  solutions  describe  low  recording  efficiency  situ¬ 
ations  in  which  the  majority  of  the  photons  in  the  illumination  beam  con¬ 
tribute  a  uniform  photocurrent  and  only  a  small  fraction  of  the  photons  con¬ 
vey  the  spatial  structure  in  the  image  profile.  This  point  is  emphasized  in 
Fig.  3.17.  in  which  numerical  solutions  of  the  photorefractive  equations  are 
presented,  showing  the  temporal  evolution  of  the  first  spatial  harmonic  com¬ 
ponent  of  the  space  charge  field  for  various  modulation  depths  m.  These 
solutions  have  been  produced  by  the  authors  using  numerical  techniques  dis¬ 
cussed  by  Moharam  et  al.  ( 1979)  but  applied  to  the  full  set  of  photorefractive 
equations  proposed  by  Kukhtarev  (1976). 

In  Fig.  3. 17,  a  crystal  of  bismuth  silicon  oxide  illuminated  by  a  300  cycle/ 
mm  sinusoid  has  been  assumed.  Note  in  this  figure  the  slight  oscillations 
that  can  be  observed  in  the  temporal  evolution.  The  strength  of  these  os¬ 
cillations  is  directly  dependent  on  the  charge  mobility-lifetime  product,  m.t; 
the  curves  in  Fig.  3. 17  correspond  to  the  material  and  configuration  param¬ 
eters  as  listed  in  Table  3.3.  It  should  be  noted  that  a  wide  variation  in  mo¬ 
bility-lifetime  products  has  been  reported,  even  for  crystals  with  nominally 
the  same  composition  (Lesaux  et  al..  1986). 

Note  also  in  Fig.  3.17  that  the  highest  space  charge  fields  are  associated 
with  the  highest  illumination  profile  modulation  depths,  a  regime  for  which 
the  time-dependent  analytical  solutions  derived  by  Kukhtarev  are  no  longer 
applicable.  Furthermore,  to  first  order  the  resultant  space  charge  field  £,. 
both  in  the  initial  recording  regime  as  well  as  in  saturation,  is  directly  pro¬ 
portional  to  the  modulation  depth  parameter  m. 
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rig.  J.17. 


GRATING  RECORDING  TIME 
(SCALED  TO  THE  DIELECTRIC  RELAXATION  TIME) 

Incivase  In  saturation  space  charge  fieid  with  Increasing  modulation  depth  for 

BiiiSiOM.  showing  transition  hom  linear  to  nonilnear  recording  regimes. 

3.3.3.  Enhanced  Recording  Techniques 

The  recording  configuration  just  considered  assuntes  a  stationary  illumina¬ 
tion  profile  and  a  constant  applied  bias  field  (for  brevity,  hereinafter  called 
the  stationary  illumination  technique).  Space  charge  fields  with  much  higher 
steady  state  limits  can  be  obtained  by  either  of  two  alternative  nonsuuionary 
recording  configurations:  One  technique  is  to  translate  the  illumination  pro¬ 
file  with  respect  to  the  photorefractive  crystal  (hereinafter  referred  to  as  the 
running  grating  technique)  (Huignard  and  Marrakchi,  1981;  Stepanov  et  al., 
1982;  Valley.  1984;  Reftegier  et  al..  1985).  and  the  second  technique  is  to 
periodically  reverse  the  direction  of  the  applied  bias  field  (the  alternating 
field  technique)  (Stepanov  and  Petrov.  1985).  These  techniques  do  not  im¬ 
prove  the  rate  at  which  space  charge  buiid:^  up  with  time  under  constant 
illumination  intensity,  compared  with  the  stationary  illumination/constant 
field  recording  configuration. 

The  running  grating  technique,  in  particular,  was  chosen  for  study  herein. 
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both  to  illustrate  enhanced  photorefractive  recording  concepts  and  because 
it  is  commonly  employed  to  provide  significant  amplification  of  weak  im¬ 
ages.  This  technique  can  be  studied  by  a  combination  of  analytical  solutions 
that  are  valid  in  the  linearized  regime  of  very  small  modulation  depths  and 
by  numerical  solutions  for  larger  image  modulations. 

The  relative  advantage  of  the  tunning  grating  technique  is  summarized  in 
Fig.  3.18.  which  has  been  derived  from  the  analytical  solutions  of  Valley 
(1984)  and  Refregier  et  al.  (198S)  in  the  limit  of  small  illumination  profile 
modulation  depths.  A  grating  spatial  frequency  of  1 10  cycles/mm  and  pa¬ 
rameters  typical  of  bismuth  silicon  oxitte  have  been  assumed  in  the  solutions 
shown  in  Fig.  3. 18.  The  spatial  frequeiKy  is  chosen  to  be  110  cycles/mm. 
rather  than  300  cycles/mm.  because  the  enhancement  of  the  steady  state 
space  charge  field  is  maximized  for  this  grating  frequency,  assuming  ma¬ 
terial  parameters  for  Bii^SiO^  as  given  in  Table  3.3.  Note  that  the  steady 
sute  limit  of  the  space  charge  field  is  in  fact  increased  by  the  running  grating 
technique  relative  to  that  obtained  with  a  stationary  grating,  but  that  the 


GRATING  recording  TIME 
(SCALED  TO  THE  DIELECTRIC  RELAXATION  TIME) 

Temporal  evolution  of  the  space  ctiargenekt  comparing  the  cases  of  statlonaiy 
and  running  gratings  for  Bi,iSiO]o  In  the  low  modulation  limit 
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initial  recording  sensitivities  are  asymptotically  equal.  The  initial  growth  of 
the  space  charge  field  continues  to  be  bounded  by  a  combination  of  quantum 
limitations  and  charge  transport  inefficiencies,  as  described  above.  Further¬ 
more.  the  additional  factor  of  two  reduction  in  sensitivity  observed  on  ap¬ 
proach  to  saturation  obtains  for  both  solutions. 

The  magnitude  of  the  steady  state  space  charge  field  is  necessarily  quite 
small  in  the  linearized  regime  for  which  the  analytical  solutions  apply,  as 
the  field  is  proportional  to  the  modulation  depth  m  of  the  illumination  pro¬ 
file.  which  must  be  kept  small  to  ensure  accurate  analytical  solutions.  Larger 
space  charge  fields  require  larger  illumination  modulation  depths,  which  in 
turn  result  in  eventual  nonlinear  saturation  of  the  space  charge  field  itself. 
Numerical  solutions  have  been  generated  by  the  authors  to  explore  the  onset 
of  this  field  saturation,  with  typical  results  as  shown  in  Fig.  3.19.  again 
assuming  the  nominal  Bii^SiO^o  material  parameters  listed  in  Table  3.3.  Note 
that  the  saturation  of  the  steady  state  field  occurs  at  quite  modest  values  of 
1  the  modulation  depth,  which  is  consistent  with  the  experimental  observations 

reported  by  Refregier  et  al.  ( 198S).  Thus  the  running  grating  technique  proves 
to  be  most  effective  for  amplifying  weak  images,  but  not  for  recording  large 
space  charge  fields.  For  recording  the  largest  fields,  stationary  illumination 
is  preferred. 


1 


Steady  state  space  ctiarge  fWd  as  a  function  of  modulation  depth  for  ei„SiO». 
comparing  the  cases  of  stationary  rmd  running  gialinga. 


I 
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4.  _ _ 

Representative  Grating  Recording  Cfflciency 
^  Caiculations 

in  order  to  illustrate  the  above  concepts,  we  proceed  in  this  section  to  con¬ 
sider  the  several  factors  that  contribute  to  the  overall  grating  recording  ef¬ 
ficiency  for  two  different  types  of  materials  and  for  two  different  types  of 
applications.  The  materials  considered  are  bismuth  silicon  oxide  (Bi,:SiO:„ . 

I  or  BSO)  and  barium  titanate  (BaTiOj):  the  principal  material  parameters 

assumed  in  the  estimates  are  listed  in  Table  3.3.  based  upon  a  set  of  values 
utilized  in  previous  related  analyses  by  Valley  and  Klein  ( 1983).  The  two 
applications  considered  are  those  of  reconfigurable  holographic  intercon¬ 
nections  and  the  amplification  of  weak  images. 

Let  us  first  consider  a  reconfigurable  holographic  interconnection  imple- 
^  mented  in  bismuth  silicon  oxide.  For  simplicity,  we  assume  that  only  one 

grating  with  a  spatial  frequency  of  300  cycles/nun  is  recorded  in  the  crystal. 
To  achieve  maximum  diffraction  efficiency,  the  modulation  depth  of  the 
recording  beams  should  be  as  large  as  possible;  ideally,  m  -  1 .  Also,  a  bias 
electric  field  is  typically  applied  to  crystals  of  BiuSiO^o  to  enhance  the  pho- 
^  tosensitivity;  a  field  of  6  kV/cm  is  typical,  implying  a  drift  transpon  length 

Le  from  Eq.  (3. 18)  of  9  ^.m.  assuming  the  mobility-lifetime  product  given 
in  Table  3.3.  For  a  300  cycle/mm  grating  frequency,  this  implies  an  (Le/ 
.\c)  ratio  of  about  2.7,  and  from  Fig.  3.  IS  we  see  that  this  corresponds  to 
essentially  100%  charge  transport  efficiency  in  the  initial  stages  of  recording. 

The  recording  efficiency  for  a  BiijSiO^,  interconnect,  compared  with  the 
^  ideal  (i.e.,  bipolar  comb)  quantum  efficiency  limit,  is  listed  in  Table  3.4. 

Three  factors  are  considered  in  this  and  subsequent,  tables.  The  first  is  the 


Tabu  3.4 

Represcnutive  Crating  Recording  Efficiency  Calculation:  Reconfigurable  Interconnection 
Drift  Recording  in  Bismuth  Silicon  Oxide 


Modulation  depth  •  1 .0 

300  cycles/ mm  grating  frequency 

Space  charge  efficiency  factors: 

Baseline  sinusoid/bipolar  comb 

0.25 

Charge  transpon  (to  saturation) 

0.5 

Modulation  depth  factor 

1.0 

Grating  recording  efficiency 

0.125 

Diffraction  efficiency  deraung  faaor 

0.016 

) 
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15^c  efficiency  factor  that  applies  between  the  baseline  sinusoid  and  bipolar 
comb  cases,  a  factor  that  is  common  to  all  recording  configurations  consid¬ 
ered  in  this  section.  The  second  factor  is  the  charge  transport  efficiency, 
comparing  actual  recording  performance  to  that  of  the  baseline  sinusoid  case. 
The  Bi,:SiO;o  interconnect  is  assigned  a  50%  charge  transport  efficiency  to 
account  for  the  factor  of  two  increase  in  recording  energy  (photon  flux) 
estimated  to  reach  saturation,  as  shown  in  Fig.  3.16  and  as  discussed  in 
Section  3.3.2.  The  final  factor  is  the  modulation  depth,  which  we  have  as¬ 
sumed  to  be  unity  for  a  reconfigurable  interconnect.  Thus  the  total  grating 
recording  efficiency,  in  terms  of  the  magnitude  of  the  space  charge  field 
generated  per  unit  photon  flux,  compared  with  ideal  quantum  efficient  re¬ 
cording.  is  only  about  12.5%  for  a  Bii-SiO^  interconnection.  This  gives  a 
diffraction  efficiency  derating  factor  of  only  1 .6%  (assuming  a  diffraction 
efficiency  that  is  proportional  to  the  square  of  the  space  charge  field). 

As  a  second  example,  consider  image  amplification  in  Bii^SiO^  using  a 
running  grating  enhanced  recording  technique.  We  will  again  assume  a  1 10 
cycle/mm  grating  spatial  frequency  (see  Section  3.3.3)  and  a  bias  field  of 
6  kV/cm  applied  to  the  Bt^SiO^o  crystal.  However,  the  modulation  depth 
must  be  reduced  from  100%  to  approximately  10%  to  achieve  the  peak  space 
charge  enhancement  in  saturation  (^vided  by  the  running  grating  recording 
technique,  as  shown  in  Fig.  3. 19  and  as  discussed  in  Section  3.3.3.  For  our 
calculations,  we  chose  a  modulation  depth  equal  to  10%.  implying  a  cor¬ 
responding  reduction  in  the  quantum  efficiency  of  the  recording  process; 
i.e..  most  of  the  photons  must  supply  the  pump  beam,  with  comparatively 
few  photons  in  the  signal  beam  containing  signal  information.  Thus  the  total 
quantum  efficiency  for  image  amplification,  shown  in  Table  3.5.  is  an  order 
of  magnitude  lower  than  that  given  in  Table  3.4  for  the  reconfigurable  in- 


TMU3.9 

Repretemativ*  Gntiiif  Ramdiiig  Efflciciicy  CalculMiM;  Two>Wav«  Image  Ampiificaiion 
Running  Gratings  m  Sismudi  Silicon  Oxide 


Moduladon  depth  -  0. 1 

1 10  cycles/mm  grating  frequency 

Space  charge  efficiency  factors: 

Baseline  sinusoid/bipolar  comb 

0,23 

Charge  mmspoR  (to  saturation) 

0.3 

Modulation  depth  factor 

0.1 

Grating  recording  efficiency 

0.0123 

Diffiaction  efficiency  derating  factor 

0.00016 
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terconnection.  Correspondingly,  the  diffraction  efficiency  derating  factor  is 
two  orders  of  magnitude  lower  for  this  case  than  for  the  case  of  the  recon- 
figurable  interconnection  and  nearly  four  orders  of  magnitude  less  efficient 
than  the  absolute  quantum  limitation. 

As  a  final  example,  consider  a  reconfigurable  interconnection  in  barium 
titanate.  with  grating  recording  efficiency  as  shown  in  Table  3.6  for  the  case 
of  recording  by  diffusion  transport  only.  Because  of  the  large  electrooptic 
coefficient  in  barium  titanate.  only  very  modest  space  charge  fields,  typi¬ 
cally  a  fraction  of  the  diffusion  field  for  a  300  cycle/mm  grating,  are  needed 
to  achieve  peak  diffraction  efficiencies  in  crystals  of  reasonable  size.  Hence 
diffusion  transport  often  proves  to  be  sufficient  in  this  material  in  order  to 
generate  experimentally  useful  diffraction  efficiencies.  Unfortunately,  two 
factors  work  against  the  high  electrooptic  coefficient  in  barium  titanate.  One 
is  the  concomitantly  high  dielectric  constant,  which  from  Maxwell's  first 
equation  implies  that  a  considerable  amount  of  space  charge  must  be  moved 
to  achieve  a  modest  space  charge  field.  As  mentioned  in  the  introduction, 
the  combined  material  parameter  which  is  a  measure  of  the  amount 

of  optical  index  modulation  per  unit  space  charge,  proves  to  be  surprisingly 
constant  from  material  to  material  (Glass  et  al..  1984;  Glass,  1984).  Based 
upon  this  measure  alone,  barium  titanate  proves  to  be  modestly  superior  to 
bismuth  silicon  oxide,  as  shown  in  Table  3.3; 

The  second,  and  far  more  serious^  factor  degrading  grating  recording  ef¬ 
ficiency  for  diffusion  recording  in  barium  titanate  is  its  low  mobility-lifetime 
product  p.T.  which  is  almost  two  ordeis  of  magnitude  lower  than  that  for 
bismuth  silicon  oxide,  implying  a  significantly  degraded  charge  transport 
efficiency.  The  diffusion  transport  length  Lq  predicted  by  Eq.  (3.19)  is  of 
order  35  nm.  based  upon  the  mobility-lifetime  product  p.T  given  in  Table 


Tabu  3.6 

Repmcnutive  Ctaiiii|  Recording  Efficiency  Caiculaiion;  Reconfigurable  Interconnect 
Diffusion  Recording  in  Banum  Titanate 


Modulation  depth  •  1 .0 

300  cycles/mm  grating  frequency 

Space  charge  efficiency  factors: 

Baseline  sinusoid/bipolar  comb  0.23 

Charge  transport  (to  saturation)  0.002 

Modulation  depth  factor  1 .0 

Crating  tecotding  efficiency  0.0003 

Diffraction  efficiency  derating  factor  2.3  x  10'' 
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3.3.  Assuming  a  300  cycle/mm  grating,  this  implies  by  Eq.  (3.21 )  a  charge 
transport  efficiency  factor  of  order  0.(X)4.  We  include  an  additional  factor 
of  O.S  in  Table  3.6  to  account  for  the  additional  photon  flux  required  to 
reach  saturation,  as  indicated  in  Fig.  3.16  and  by  the  discussion  in  Section 
3.3.2.  Thus  we  find  that  the  grating  recording  efficiency  for  diffusion  re¬ 
cording  in  barium  titanate  is  some  250  times  less  than  that  for  drift  recording 
in  bismuth  silicon  oxide,  in  terms  of  space  charge  generated  per  unit  incident 
photon. 

Thus  we  have  considered  representative  examples  of  several  different  ma¬ 
terials.  transport  mechanisms  and  efficiencies,  and  recording  applications. 
We  find  that,  even  in  the  most  efficient  grating  recording  configurations,  a 
significant  inefficiency  still  exists  between  the  absolute  quantum  limits  and 
actual  performance. 


5.^ _ _ _ 

Conclusions 

In  this  chapter,  we  have  considered  a  number  of  the  fundamental  physical 
limitations  that  constrain  the  potential  performance  of  photorefiractive  ma¬ 
terials.  In  panicular.  we  have  described  several  idealized  photogeneration 
and  charge  transport  models  in  terms  of  the  grating  recording  efficiency, 
and  we  have  identified  one  such  model  (the  bipolar  comb)  as  the  absolute 
quantum  limit  against  which  other  such  models  may  be  compared.  The  bi¬ 
polar  comb  model  generates  the  maximum  possible  fundamental  harmonic 
of  the  space  charge  field  at  unity  dielectric  constant  for  a  given  number  of 
photoexcited  mobile  charge  carriers.  A  second  idealized  model,  the  baseline 
sinusoid,  provides  an  upper  bound  for  the  grating  recording  efficiency  of  a 
more  realistic  photorefnctive  grating  recording  model  involving  a  single  mobile 
charge  carrier  and  a  single  donor/single  trap  photorefractive  center.  This 
upper  bound  was  shown  to  be  a  factor  of  four  less  efficient  in  generating  a 
given  space  charge  field  than  the  quantum  limitations  imply  for  an  optimum 
photogeneration  distribution  and  perfectly  efficient  charge  transport.  An  ad¬ 
ditional  factor  of  two  accrues  froiii  the  nonlinearity  of  the  grating  recording 
process  observed  as  the  space  charge  field  nears  saturation.  The  combination 
of  absorption  and  reflection  losses  in  t3qMcal  photorefractive  recording  con¬ 
figurations  (without  antireflection  coatings)  contributes  approximately  one 
order  of  magnitude  to  the  inefficiency  of  grating  recording  and  readout  rel¬ 
ative  to  the  incident  photon  flux.  Hence,  the  usual  photorefractive  reccing 
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configuration  exhibits  an  overall  sensitivity  that  is  approximately  two  orders 
ot  magnitude  less  than  that  achievable  in  the  quantum  limit.  This  results  in 
roughly  four  orders  of  magnitude  reduction  in  the  corresponding  diffraction 
efficiency  per  incident  photon.  These  considerations  explain  to  a  certain  de¬ 
gree  why  photorefractive  recording  has  proven  to  be  relatively  insensitive 
as  compared  with  distinct  but  related  mechanisms  of  spatial  light  modulation. 

Perhaps  far  more  important,  however,  are  the  implications  of  the  above 
analysis  for  the  conceptual  design  and  technological  implementation  of  op¬ 
timized  grating  recording  media  that  operate  far  closer  to  the  quantum  limits. 
For  example,  the  bipolar  comb  illumination  profile  is  not  necessarily  as  un¬ 
physical  as  it  may  at  first  seem,  and  it  can  be  closely  approximated  by  the 
utilization  of  stratified  volume  holographic  optical  elements  (Johnson  and 
Tanguay.  1988)  for  beam  formation.  This  approach  is  perhaps  most  appro¬ 
priate  for  the  generation  of  a  grating  of  given  spatial  frequency,  as  in  the 
photorefractive  incoherent-to-coherent  optical  converter  (Marrakchi  et  al.. 
198S).  As  a  second  example,  the  factor  of  two  inherent  in  the  approach  to 
saturation  can  be  avoided  by  seeking  photorefractive  materials  of  near  unity 
charge  transport  efficiency  and  high  electrooptic  figures  of  merit,  such  that 
experimentally  suitable  diffraction  efficiences  can  be  obtained  well  within 
the  linear  recording  regime.  As  a  final  example,  the  inefficiency  implied  by 
surface  reflection  losses  can  be  dramatically  reduced,  even  over  the  broad 
spectrum  of  angles  and  wavelengths  characteristic  of  photorefractive  re¬ 
cording.  by  the  design  and  incorporation  of  appropriate  antireflection  coat¬ 
ings  (Karim  et  al.,  1988). 
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CHAPTER  15 


PHOTONIC  IMPLEMENTATIONS  OF 
NEURAL  NETWORKS 


B.  Keith  Jenkins  and  Armand  R.  Tanguay,  Jr. 


TOWARDS  THE  DEVELOPMENT  OF  A 

NEURAL  NETWORK  IMPLEMENTATION  TECHNOLOGY 


As  described  in  other  chapters  of  this  book,  neural  networks  provide  a  different  ap¬ 
proach  to  solving  problems  as  compared  with  more  conventional  algorithmic  techniques, 
and  can  be  applied  to  a  wide  range  of  applications.  In  some  of  these  application  domains 
the  simulation  performance  of  neural  networks  has  been  comparable  to  that  of  more  con¬ 
ventional  algorithmic  approaches.  In  many  application  domains,  however,  a  realistic  (and 
therefore  large  scale)  problem  may  overwhelm  the  conventional  approach,  in  that  it  may 
be  too  computation  intensive  to  be  implemented  on  a  sequential  digital  computer,  and 
may  not  parallelize  sufficiently  well  (if  at  all)  for  efficient  computation  on  a  parallel  digital 
machine.  On  the  other  hand,  because  a  neural  network  2dgorithm  is  inherently  parallel,  it 
inunediately  suggests  a  parallel  architecture,  which  may  in  turn  be  implemented  using  ei¬ 
ther  analog  or  digital  hardware.  And  for  the  case  of  large-scale  problems,  analog  hardware 
will  typically  provide  a  much  more  efficient  neural  implementation  than  digital  hardware. 

In  the  previous  chapter,  fully  electronic  (primarily  VLSI-based)  neural  networks  were 
described  in  which  the  primary  functionality  of  both  the  neuron  units  and  the  weighted 
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(synaptic)  interconnection  matrix  is  incorporated  on  a  planar  microelectronic  chip.  An 
important  advantage  of  the  integrated  circuit  approach  to  neural  network  implementation 
is  the  capability  for  near-term  technology  insertion,  with  leverage  provided  by  a  well- 
established  technology  base  characterized  by  a  fully  developed  computer  aided  design  and 
computer  aided  manufacturing  (CAD/CAM)  device  and  circuit  repertoire.  An  equally 
important  limitation  is  the  difficulty  in  scaling  up  neural  chips  to  incorporate  large  numbers 
of  neuron  units  in  fully  (or  near  fully)  interconnected  architectures.  This  limitation  derives 
from  the  limited  pin-out,  off-chip  communication  bandwidth,  and  on-chip  interconnection 
density  available  in  both  current  generation  and  projected  chip  designs. 

In  this  chapter,  we  consider  the  utilization  of  optical  (free-space)  interconnection  tech¬ 
niques  in  conjunction  with  photonic  switching  and  modulating  devices  to  expand  the  num¬ 
ber  of  neuron  units  and  complexity  of  interconnection,  by  using  the  off-chip  (3’’**)  dimension 
for  synaptic  communication.  As  we  shall  see,  the  merging  of  optical  and  photonic  devices 
with  appropriately  matched  electronic  circuitry  can  provide  novel  features  such  as  ful¬ 
ly  parallel  weight  updates  and  modular  scalability,  as  well  as  both  short  and  long  term 
synaptic  plasticity. 

Many  approaches  to  the  incorporation  of  photonic  and  optical  technology  in  the  im¬ 
plementation  of  neural  networks  are  currently  being  pursued  in  the  research  community. 
The  intent  of  this  chapter  is  not  to  present  a  review  of  these  various  approaches,  the 
details  of  which  can  be  obtained  from  several  of  the  references  given  in  the  Suggested 
Further  Reading  section  at  the  end  of  this  chapter.  Instead,  our  focus  herein  is  directed 
toward  a  description  of  key  photonic  devices  and  techniques  based  on  fundamental  optical 
phenomena,  as  well  as  toward  a  unique  and  generalizable  approach  to  their  potential  use 
in  the  implementation  of  large-scale,  highly  parallel  neural  network  architectures.  The 
unusual  nature  of  some  of  these  techniques  has  interesting  implications  for  the  design  of 
naturally  mapped  2irchitectures  and  associated  learning/computing  algorithms.  We  will 
address  a  number  of  these  unique  features  in  the  context  of  a  description  of  the  basic  op¬ 
tical  phenomena  that  can  be  used  to  advantage,  and  of  the  array  of  photonic  devices  that 
comprise  the  system  designer’s  palette.  Because  the  incorporation  of  optical  and  photonic 
hardware  casts  the  subject  of  neural  network  implementations  in  a  somewhat  unfamiliar 
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light,  we  first  discuss  a  set  of  desirable  and  requisite  characteristics  for  neural  network 
implementation  technologies. 

An  important  feature  of  any  implementation  technology  is  that  of  generality;  a  “build¬ 
ing  block”  approach.  The  growth  and  synthesis  of  material  structures,  and  their  incorpo¬ 
ration  into  devices,  must  be  well  characterized,  understood,  and  repeatable,  for  a  small 
number  of  specific  material  combinations  and  device  structures.  These  devices  are  then 
assembled  into  circuits  or  architectures  for  the  implementation  of  specific  computational 
models.  This  provides  leverage  in  two  ways:  (1)  the  small  number  of  useful  and  well  un¬ 
derstood  components  are  used  repeatedly  in  different  structures  for  different  applications, 
and  (2)  architecture  and  system  level  designers  need  not  be  experts  in  the  properties  of  the 
material  and  device  structures  used  to  configure  the  components,  saving  many  man-hours 
in  the  design  of  computation«d  systems  over  a  completely  custom  approach.  Such  a  purely 
custom  approach  could  preclude  the  widespread  use  of  these  architectures  and  systems,  as 
has  been  characteristic,  for  example,  of  optical  information  processing  and  optical  signal 
processing  systems  over  the  past  two  decades. 

Not  only  is  it  important  for  the  implementation  technology  to  be  of  a  building  block 
nature,  but  it  is  also  important  for  the  models  underlying  the  computational  architectures 
to  support  such  an  approach,  and  in  fact  to  be  of  a  biiilding  block  nature  themselves. 
Ideally  the  models  would  comprise  a  set  of  common  components  and  operations  at  the 
functional  leve',  such  as  specific  types  of  neuron  units,  weighted  interconnections,  weight 
updates,  and  comparisons  with  desired  target  values.  Then  the  mapping  from  model  and 
functional  architecture  to  hardware  architecture,  layout,  and  implementation  can  proceed 
efficiently  as  well.  This  building  block  approach  at  both  the  model  anc  har.  vare  levels 
has  certainly  been  characteristic  of  the  development  of  digital  electronics,  and  has  been 
largely  responsible  for  its  success. 

Assuming  that  appropriate  neural  network  models  and  a  corresponding  technology 
base  can  be  merged  within  a  compatible  building  block  approach,  neural  network  system- 
s  potentially  provide  a  unique  capability  for  large-scale  analog,  nonlinear  computation. 
As  such,  the  neurad  network  paradigm  potentially  alleviates  two  critical  bottlenecks  that 
have  impeded  the  widespre2wl  implementation  of  large-scale  Malog,  nonlinear  computing 
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systems  based  on  non-neural  architectures;  the  lack  of  appropriate  generic  hardware  com¬ 
ponents  and  of  the  sufficient^:’  leveraged  manpower  required  for  their  economical  design 
and  manufacture,  as  well  as  difficulty  in  establishing  efficient  techniques  for  mapping  from 
the  application  and  model  domain  onto  compatible  hardware.  With  regard  to  the  for¬ 
mer  bottleneck,  neural  network  architectures  are  generally  forgiving  with  respect  to  device 
I  nonuniformities  and  imperfections,  creating  much  needed  latitude  for  the  device  designer. 

With  regard  to  the  latter  bottleneck,  the  neural  network  paradigm  inherently  provides  a 
mapping  from  the  problem  domain  onto  a  highly  parallel  architecture,  which  immediately 
yields  a  starting  point  for  its  layout  in  analog  hardware. 

I 

In  the  ctise  of  photonics  for  neural  network  applications,  the  hardware  technology  is 
being  developed  simultaneously  with  the  neural  computation  model(s).  This  implies  at 
least  two  things.  First,  it  is  crucial  to  retain  flexibility  in  the  functionality  of  each  com- 
,  ponent,  so  that  as  the  neural  computation  models  evolve,  the  hardware  can  evolve  along 

with  it.  The  development  of  an  entirely  new  technology  base  typically  takes  at  least  a 
decade;  such  a  delay  between  model  development  and  hardware  realization  is  generally  u- 
nacceptable.  Thus,  the  generic  technology  base  must  provide  sufficient  flexibility.  Second, 

'  not  only  should  the  neural  computation  model  steer  the  technology  development,  but  the 

reverse  can,  and  indeed  must,  also  occur.  This  assures  a  mutual  compatibility  in  outcome. 

The  bcisic  requisite  functions  for  a  neural  network  technology  base  appear  to  be:  neu¬ 
ron  unit  response,  weighted  interconnections  (fixed  and  variable),  input/output,  learning 
con^putation  and  weight  update,  and  duplication  capability  (i.e.,  the  capability  of  making 
a  copy  of  a  network  structure).  In  addition,  other  features  are  desirable,  such  as  higher 
order  connection  capability.  The  neuron  unit  response,  at  the  most  common  and  basic 
'  level,  is  a  sum-of-inputs  followed  by  a  monotonic  nonlinearity.  The  nonlinearity  should 

have  the  flexibility  of  providing  different  amounts  of  gain;  for  example,  it  is  useful  to  have 
a  high  gain  to  implement  a  binary  threshold,  and  a  low  gain  to  implement  a  nearly  linear 
response.  The  neuron  unit  should  be  bipolar  in  that  it  permits  both  positive  and  negative 
inputs,  so  that  inhibitory  and  excitatory  connections  can  be  realized.  This  we  consider 
to  be  the  minimal  requisite  functionality  of  a  neuron  unit.  In  addition,  a  very  desirable 
feature  is  the  capability  for  bipolar  outputs.  Note  that  biologically  this  is  not  necessarily 
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the  case,  but  useful  neural  computation  models  will  likely  deviate  substantially  from  bio¬ 
logical  reality  and  may  require  this  capability.  Other  capabilities  are  also  desirable,  such 
as  leaky  integrator  effects  [Mead,  1989]  and  more  complex  behavior  such  as  that  required 
in  shunting  networks  [Carpenter,  1987). 

Each  weighted  interconnection  must  store  a  learned  or  initialized  value,  and  perform 
a  multiplication  operation  on  the  signals  passing  through.  An  analog  multiplication  is 
generally  much  more  efficient  than  a  digital  one  for  reasons  of  speed  and  device  area 
or  volume.  For  this  reason,  it  is  worth  some  effort  to  provide  analog  storage  for  the 
weights.  Note  that  the  very  large  number  of  weights  used  in  many  networks  implies 
that  minimization  of  the  incorporated  hardware  complexity  of  the  requisite  storage  and 
multiplication  operations  is  crucial  for  physical  realizability.  Input/output  is  often  ignored 
at  the  higher  levels,  but  can  critically  affect  the  physical  architecture  and  can  be  a  major 
factor  in  determining  the  overall  throughput.  The  input/output  function  includes  the 
input  of  signals,  the  output  of  results,  and  the  input  of  weights  if  necessary. 

It  is  important  at  the  outset  to  distinguish  among  systems  that  have  fixed  weights; 
systems  that  have  programmable  weights  (that  are  externally  computed  but  loaded  into 
the  network);  and  systems  that  have  full  learning  capability,  in  which  the  learning  algorithm 
is  implemented  as  part  of  the  parallel  system.  The  associated  hardware  complexity  can 
be  quite  different  in  each  of  these  cases.  Finally,  duplication  capability  is  useful,  for 
extunple,  for  replicating  a  pre-learned  network  when  multiple  copies  of  the  network  are  to 
be  produced  and  subsequently  used  with  fixed  connections.  Probing  the  weight  values  in  a 
hardware  implementation  may  at  first  sound  straightforward,  but  implementing  very  large 
numbers  of  weights  in  a  small  volume  can  in  some  cases  preclude  such  capability. 

Applications  of  h2irdware  implementations  of  neural  networks  could  include  sensor  sig¬ 
nal  processing  and  fusion,  pattern  recognition,  associative  memory,  and  robotic  control. 
These  applications  imply  a  wide  range  of  hardware  requirements.  For  example,  most  vi¬ 
sion  processing  is  characterized  by  moderately  large  numbers  of  neuron  units,  with  small 
to  moderate  connectivity  and  primarily  local  interconnections.  Associative  memory,  on 
the  other  hand,  typically  requires  a  very  high  connectivity. 

Semiconductor-based  VLSI  technology  has  proven  to  be  very  capable  for  the  implemen- 
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tation  of  most,  if  not  all,  of  the  above  functions.  It  also  h2is  the  capability  for  integration  of 
control  circuitry  and/or  arbitrary  digital  or  logical  operations  on  the  same  chip  as  the  neu¬ 
ral  processing  circuitry.  However,  an  important  issue  in  neural  implementations  is  that  of 
scalability^  since  many  neural  network  applications  are  likely  to  require  very  large  numbers 
of  neuron  units  and  connections.  As  pointed  out  above,  it  is  primarily  the  consideration 
of  scalability  that  leads  us  to  the  conclusion  that  purely  electronic  VLSI  will  work  well  for 
certain  applications,  but  can  benefit  greatly  from  the  incorporation  of  photonics  for  other 
applications. 

Two  important  considerations  in  VLSI  implementations  are  area  complexity  and  pinout 
requirements.  A  fully  connected  network  of  N  neuron  units  requires  area  0{N^).  One 
can  think  of  this  as  having  only  a  single  linear  dimension  available  for  the  neuron  units 
themselves,  in  order  to  leave  room  for  the  connections.  This  of  course  limits  the  size 
of  fully  connected  networks  that  can  be  accommodated  on  a  single  chip.  For  example, 
chips  have  been  fabricated  with  54  neuron  units  and  2916  ternary  (3-level)  synapses,  and 
it  is  estimated  that  approximately  700  neurons,  fully  connected  with  (10®  -  10®)  similar 
synapses,  could  be  implemented  on  a  CMOS  chip  using  0.5  fim  design  rules.  Learning 
capability  with  analog  synapses  may  require  substantially  more  area  per  synapse  [Jackel, 
1988].  On  the  other  hand,  networks  with  low  connectivity  and  only  local  connections 
between  neuron  units  permit  a  much  larger  number  of  neuron  units  to  be  implemented  on 
a  chip.  For  example,  the  silicon  retina  of  Mead  et  al.  comprises  a  48  x  48  ewray  of  neuron¬ 
like  units,  each  connected  to  its  6  nearest  neighbors  on  a  hexagonal  grid  [Mead,  1988).  A 
much  larger  number  of  neuron  units,  on  the  order  of  10®,  could  be  implemented  with  such 
a  locally-connected  array,  since  the  neuron  units  and  the  connections  each  require  area 
only  0{N).  So  we  see  that  a  critical  limiting  factor  in  the  VLSI  implementation  of  neural 
networks  is  the  area  required  for  on-chip  interconnections,  and  that  the  area  required  for 
neuron  units  is  relatively  inconsequential. 

The  number  of  pinouts  that  can  be  provided  on  a  chip  is  proportional  to  its  linear 
dimension,  not  to  its  area.  This  degree  of  pinout  capacity  is  well  matched  to  fully  connected 
networks  in  which  the  weights  do  not  need  to  be  input  or  output  frequently,  auid  to  locally 
connected  networks  of  low  to  moderate  bandwidth.  An  example  of  the  latter  is  a  vision 


network  that  operates  at  video  frame  rates;  in  this  case  the  signal  can  be  easily  time 
multiplexed  onto  a  relatively  small  number  of  lines  for  communication  onto  and  off  of  the 
chip. 

On  the  other  hand,  other  application  areas  will  require  a  higher  input/output  (I/O) 
bandwidth  and/or  a  large  number  of  neuron  units  with  high  connectivity.  For  example, 
if  the  weights  are  to  be  fed  onto  and  off  of  a  chip  frequently,  substantial  multiplexing 
would  be  required  for  reasonably  large  networks  (e.^.,  up  to  200  pinouts  can  generally 
be  accommodated,  but  as  described  above,  as  many  as  10®  -  10®  synaptic  weights  can  be 
incorporated  on  the  chip).  Wafer  scale  integration  with  bump  bonding  techniques  can 
help  by  providing  laxge,  multi-wafer  structures,  but  the  number  of  I/O  lines  is  still  likely 
to  be  modest  due  to  practical  and  physical  constraints.  So  we  see  that  a  second  critical 
limiting  factor  in  the  VLSI  (and  wafer  scale  integration)  implementation  of  neural  net¬ 
works  is  the  number  of  I/O  lines  that  can  be  practically  incorporated.  Neural  applications 
utilizing  large  fully  connected  networks  such  as  associative  memory  would  benefit  greatly 
from  implementations  of  10®  -  10®  fully  connected  neuron  units,  implying  the  need  for 
10^°  -  10^^  synaptic  weights.  In  addition,  the  intermediate  realm  of  large  networks  with 
partial  but  moderate-to-large  connectivity  will  likely  also  prove  beneficial  to  a  wide  range 
of  applications. 

In  this  chapter,  we  discuss  a  variety  of  issues  that  impact  the  development  of  photon¬ 
ic  technology  as  applied  to  hairdware  implementations  of  neural  networks  with  enhanced 
capabilities.  Photonics  has  the  potential  for  the  implementation  of  networks  with  large 
numbers  of  neurons  (10®  -  10®)  and  high  connectivity  (approximately  10*°  analog-weighted 
interconnections)  in  one  “module”.  The  approach  taken  here  is  to  use  electronics  to  im¬ 
plement  the  internal  function  of  each  neuron  unit,  and  to  use  optics  to  implement  the 
connections,  weights,  and  I/O.  With  this  technique  most  of  the  area  of  a  two-dimensional 
(2-D)  “chip”  can  be  used  to  implement  the  neuron  units  themselves,  and  optical  free-space 
propagation  and  volume  holograms  can  be  used  to  implement  the  interconnections.  Thus 
the  interconnections  actually  occupy  a  three-dimensional  (3-D)  volume,  which  improves 
scalability  dramatically. 

The  next  section  of  this  chapter  describes  the  fundamental  optical  principles  and  key 


photonic  technology  concepts  that  are  needed  for  neural  network  implementations,  and 
^  covers  photonic  analog  arithmetic,  switching,  interconnections,  sources  and  detectors.  Ar¬ 

chitectural  considerations  are  discussed  in  the  subsequent  section,  including  the  use  of 
volume  interconnections,  signal  representation,  and  desired  architectural  features.  The 
next  section  then  presents  a  photonic  implementation  strategy  that  satisfies  most  of  the 
^  desired  criteria.  In  the  two  concluding  sections  we  investigate  the  ultimate  limitations  of 

photonic  implementations  of  neural  networks,  and  consider  the  future  of  such  implemen¬ 
tations. 

FUNDAMENTAL  PRINCIPLES  OF 
PHOTONIC  TECHNOLOGY 

I 

In  order  to  effectively  appreciate  the  potential  advantages  as  well  as  the  limitations  of 
extending  the  VLSI  (electronic)  repertoire  to  include  photonic  components  and  optical¬ 
ly  inspired  functionality,  we  will  first  identify  and  then  explain  a  few  truly  fundamental 
^  principles  of  the  optical  and  photonic  technologies  on  which  such  hybrid  neural  network 

implementations  are  based.  In  this  section,  therefore,  we  discuss  the  basic  features  of  opti¬ 
cal  analog  computation  with  both  coherent  and  incoherent  illumination  sources,  photonic 
^  switching  devices  and  their  various  neuron-like  functions,  the  characteristics  of  photonic  in¬ 

terconnections  that  are  essential  to  the  implementation  of  synapse-like  interneuron  wiring, 
and  the  principal  features  of  sources  (photonic  power  supplies)  and  detectors  (photonic-to- 
electronic  power  converters). 


Optical  Analog  Computation 

^  At  the  present  time,  most  proposed  photonic  implementations  of  neural  networks  are 

based  on  analog  operations,  both  in  the  representation  of  neuron  outputs  as  well  as  in 
the  incorporation  of  interconnection  weights.  This  emphasis  of  analog  computation  over 

I 
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perhaps  more  familiar  digital  computation  derives  principally  from  several  distinct  advan¬ 
tages  that  accrue  to  opticail  systems  designed  to  handle  the  switching  and  interconnection 
of  very  large  numbers  of  inputs  and  outputs  at  each  circuit  node  (neuron  unit).  These  ad¬ 
vantages  include  a  significant  reduction  in  the  number  of  switching  components  required  to 
sum  multiple  inputs  [Abu-Mostafa,  1989],  an  increase  in  the  degree  of  fan-in  and  fan-out 
allowable  from  each  circuit  node,  the  elimination  of  analog-to-digital  and  digital-to-analog 
converters,  a  significant  decrease  in  signal  routing  and  interconnection  complexity,  the  po¬ 
tential  utilization  of  natural  physical  phenomena  within  certain  photonic  devices  to  accom¬ 
plish  difficult  computational  functions  directly,  and  the  possibility  of  higher  computational 
throughput  per  unit  dissipated  energy  in  operations  characterized  by  high  computational 
complexity.  Additionally,  two  notable  disadvantages  of  analog  systems,  error  accumula¬ 
tion  and  lack  of  precision  in  representation,  may  prove  to  be  relatively  unimportant  in  the 
neural  network  environment,  due  in  part  to  the  self-organizing  and  error  correcting  nature 
of  many  neural  learning  algorithms  [von  der  Malsburg,  1987).  We  will  return  to  a  number 
of  these  issues  throughout  the  remainder  of  this  chapter. 

The  computational  operations  necessary  for  the  implementation  of  a  wide  range  of 
neural  and  neural-like  networks  are  surprisingly  simple,  consisting  primeurily  of  addition, 
subtraction,  multiplication,  and  nonlinear  thresholding.  The  operation  of  addition  usually 
must  be  performed  over  a  very  large  number  of  inputs  at  a  given  neuron  unit,  representing 
weighted  excitatory  signals  from  other  interconnected  neuron  units;  subtr^lction  is  uti¬ 
lized  to  differentiate  excitatory  inputs  from  inhibitory  inputs  to  a  given  neuron  unit,  as  is 
common,  for  example,  in  models  of  the  visual  process  and  of  associative  memories.  Multi¬ 
plication  is  necessary  for  the  provision  of  linear  interconnections  with  signal-independent 
weights,  which  in  turn  store  learned  (or  pre-programmed)  information,  ^uld  hence  form  an 
important  constituent  of  the  neural  paradigm.  Finally,  nonlinear  thresholding  operations 
are  performed  in  order  to  provide  the  appropriate  transfer  function  between  the  neuron 
activation  potential  (sum  of  all  inputs,  both  positive  and  negative,  to  a  given  neuron  unit), 
and  the  output  each  neuron  unit  generates  in  response.  It  is  this  nonlinearity  in  particu¬ 
lar  that  gives  multilayer  neural  networks  their  computational  power,  and  allows  recurrent 
networks  to  iteratively  approach  one  of  several  stable  states  of  the  system.  The  detailed 
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nature  of  the  nonlinearity  itself  can  affect  a  number  of  critical  system  properties,  includ¬ 
ing  the  number  of  iterations  (or  equivalently  the  time)  required  to  achieve  steady  state, 
and  the  stability  of  the  network  in  the  presence  of  noise,  inaccuracy,  and  nonuniformities 
among  both  the  neuron  units  and  the  interconnections.  In  some  envisioned  neural  net¬ 
work  implementations,  it  is  important  to  also  be  able  to  alter  the  nature  of  the  nonlinear 
thresholding  function  dynamically  during  the  computational  phase  of  operation. 

In  addition  to  these  elementary  operations  (and  combinations  thereof),  it  is  necessary  to 
accommodate  operationally  for  both  the  learning  function  (which  includes  input-dependent 
interconnection  weight  updates)  and  for  the  computational  function  (which  in  general  re¬ 
quires  iterative  feedback  and  the  implementation  of  nonlinear  thresholding,  usually  with 
fixed  interconnection  weights).  The  only  additional  fundamental  operation  required  by 
these  features,  that  of  input-dependent  interconnection  weight  updates,  can  usually  be 
reduced  to  at  most  a  combination  or  sequence  of  the  previously  described  fundamental 
operations  of  addition,  subtraction,  multiplication,  and  nonlinear  thresholding.  It  is  im¬ 
portant  to  note,  however,  that  in  this  case  the  operations  pertain  directly  to  the  implemen¬ 
tation  of  interconnection  weights,  rather  than  to  functions  performed  by  the  neuron  units; 
hence,  the  physical  processes  involved  may  in  fact  be  considerably  different  in  nature,  and 
thus  subject  to  a  quite  different  set  of  constraints.  This  differentiation  between  the  two 
different  “types”  of  basic  operations  (those  pertaining  to  the  neuron  units  and  those  per¬ 
taining  to  the  interconnection  pathways  and  weights)  is  discussed  in  more  detail  below,  as 
well  as  in  the  section  describing  the  fundamental  physiccil  and  technological  limitations  of 
neuro-optical  computing. 

At  the  outset,  we  must  further  differentiate  between  those  fundamental  photonic  op¬ 
erations  that  are  intended  for  “optical”  implementation,  and  those  that  are  envisioned 
for  “optoelectronic”  implementation.  In  the  first  category,  we  place  those  types  of  physi¬ 
cal  processes  in  which  one  or  more  beams  of  light  interact  either  directly  (as  in  coherent 
interference)  or  through  an  intermediate  physical  medium  (as  in  the  summation  of  two  in¬ 
coherent  beams  of  light  on  a  single  detector).  In  the  second  category,  we  place  operations 
that  involve  one  or  more  photon-to-electron  or  electron- to- photon  conversion  processes 
prior  to  the  actual  implementation  of  the  desired  function,  which  is  then  assumed  to  be 
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accomplished  using  intermediate  (primarily  analog,  but  perhaps  digital)  circuitry.  An  ex¬ 
ample  of  this  latter  category  might  be  the  subtraction  of  two  optical  signals  by  independent 
but  simultaneous  photodetection  of  each  signal,  followed  by  the  use  of  an  analog  electronic 
differential  amplifier  to  execute  the  functional  subtraction.  Most  proposed  neuro-optical 
processors  rely  to  some  degree  on  both  types  of  physical  implementation  mechanisms  ( c.f. 
the  sections  “Architectural  Considerations  for  Photonic  Neural  Network  Implementation- 
s”  and  “An  Implementation  Strategy”,  below).  In  fact,  the  eventual  degree  of  success 
achievable  by  neuro-optical  computing  techniques  rests  heavily  on  an  appropriate  balance 
of  these  mechanisms  within  a  given  system,  optimized  to  yield  the  greatest  computational 
advantage  within  the  allowable  physical  and  system  constraints. 

Optoelectronic  implementations  of  neural  functions  for  the  most  part  rely  on  optical 
signals  as  inputs  and  outputs  to  and  from  the  neuron  units  in  order  to  allow  for  both 
high  bandwidth  and  high  interconnection  multiplexing  capacity,  and  on  electronic  signal 
combination  and  processing  locally  within  each  neuron  unit.  As  such,  optoelectronic  func¬ 
tions  are  characterized  primarily  by  the  optical  characteristics  of  interconnection,  and  by 
the  electronic  characteristics  of  computation.  The  former  will  be  described  below,  while 
the  latter  has  been  discussed  both  by  Bang  Lee  and  Bing  Sheu  elsewhere  in  this  volume 
[Lee,  1991]  <is  well  as  by  Carver  Mead  in  a  recent  elegant  monograph  [Mead,  1989).  On 
the  other  hand,  optical  implementations  of  these  functions  rely  on  communication  by,  as 
well  as  the  interaction  of,  two  or  more  optical  signals  in  order  to  accomplish  the  relevant 
computation,  which  is  often  (but  not  always)  followed  by  a  photon-to-electron  conversion 
process  in  some  form  of  single  channel  or  array  detector.  It  is  to  the  fundaunental  principles 
of  such  optical  computational  interactions  that  we  next  turn  our  attention. 

In  order  to  adequately  consider  even  so  basic  a  process  as  optical  addition,  it  is  essential 
to  differentiate  between  two  basic  types  of  optical  interactions  (as  determined  by  the  nature 
of  the  optical  signals  involved);  incoherent  and  coherent.  Incoherent  interactions  occur 
whenever  the  light  wavefronts  representing  the  input  signeils  temporally  dephase  (do  not 
oscillate  in  unison)  over  the  relevant  time  of  observation  (detector  temporal  integration 
window),  in  that  they  are  either  both  temporally  incoherent  at  the  outset,  or  are  each 
temporally  coherent  but  separated  in  optical  frequency  by  more  than  the  inverse  of  the 
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observation  time.  Interactions  in  which  the  input  optical  signals  spatially  dephase  over 
the  aperture  of  the  relevant  detector  wherever  the  output  is  utilized  (detector  spatial 
integration  window)  are  also  incoherent  for  all  practical  purposes,  and  will  obey  incoherent 
summation  rules  as  given  below.  Coherent  interactions  occur,  on  the  other  hand,  whenever 
the  light  wavefronts  representing  the  input  signals  simultaneously  maintain  a  constant 
phase  relationship  over  the  detector  spatial  and  temporal  integration  windows. 

From  these  remarks,  it  can  be  seen  that  it  is  quite  important  to  understand  the  dis¬ 
tinction  between  coherent  (or  incoherent)  light  and  coherent  (or  incoherent)  interactions  as 
defined  by  the  eventual  detector  configuration  and  operational  parameters.  For  example, 
it  is  perfectly  acceptable  to  consider  a  situation  in  which  two  mutually  coherent  optical 
beams  interact  to  produce  an  interference  pattern  with  a  spatial  scale  small  compared  with 
the  relevant  detector  aperture.  In  such  cases,  the  interaction  will  in  fact  follow  incoherent 
summation  rules,  as  the  detector  effectively  integrates  the  space-variant  interference  pat¬ 
tern  over  the  full  detector  aperture  to  produce  exactly  the  same  result  as  the  interaction 
of  two  mutually  incoherent  (temporally)  optical  beams. 

Given  these  preconditions,  then,  the  actual  rules  for  the  basic  operations  are  quite  s- 
trjiightforward.  Consider  first  the  case  of  addition  of  two  incoherent  optical  signal  beams 
in  a  collinear  geometry,  in  which  the  two  distinct  input  beams  with  intensities  I\  and  I2 
are  assumed  to  emerge  from  a  beam-combining  optical  system  (as  yet  undetermined)  such 
that  the  two  output  beams  are  collinear.  If  the  beams  are  combined,  for  example,  by 
a  nondispersive  (wavelength-insensitive)  50/50  (50%  transmission,  50%  reflection)  beam¬ 
splitter  as  shown  in  Figure  15.1(a),  two  possible  output  beams  /out  and  /'m  are  created, 
each  with  an  intensity  given  by: 


fou.  =  C,  =  j(^l  +  «■  (1) 

The  output  intensity  is  thus  linearly  proportional  to  the  sum  of  the  input  intensities.  Note 
that  this  operation  cannot  be  accomplished  without  an  inherent  loss,  in  the  case  shown 
above  equal  to  0.5  or  about  3  dB.  In  fact,  if  we  wish  to  combine  N  beams  collinearly  by 
this  technique  (using  a  linear  chain  of  non-dispersive  beamsplitters),  ^  —  1  beamsplitters 


are  required  with  transmissivities  given  by  1/2,  2/3,  3/4,  ...,  {N  —  1)/N,  representing  a 
total  loss  of  {N  —  l)/N  with  an  overall  throughput  of  1/N: 

hut  =  +  /s  H - !•  !n)-  (2) 

Instead  of  using  a  linear  chain  of  beamsplitters  with  different  transmissivities,  we  could 
alternatively  construct  a  binary  tree  structure  by  pairing  the  inputs  that  again  requires 
^  —  1  beamsplitters,  but  in  this  case  with  equal  transmissivities  of  5.  This  system  of 
beamsplitters  also  exhibits  ah  overall  throughput  of  jj  (for  even  values  of  N).  Although 
beam  combination  of  a  large  number  of  inputs  by  the  multiple  beamsplitter  method  is 
impractical,  we  will  utilize  this  result  a  little  later  in  order  to  understand  the  essential  fea¬ 
tures  of  holographic  be2un  combiners,  which  are  subject  to  many  of  the  same  constraints. 


FIGURE  15.1  Illustration  of  optical  addition  utilizing  a  50/50  beamsplitter; 
(a)  collinear  incoherent  beam  geometry;  (b)  collinear  coherent  beam  geometry, 
showing  input  and  output  amplitudes;  (c)  collinear  coherent  beam  geometry, 
showing  input  and  output  intensities. 


It  should  be  noted  in  passing  that  the  overall  throughput  loss  implied  by  Equation  (2) 
can  be  circumvented  if  the  beams  to  be  summed  incoherently  are  sufficiently  distinct  in 
wavelength  that  a  dichroic  mirror  can  be  used  to  combine  them.  A  dichroic  mirror  reflects 
light  within  a  given  wavelength  range,  and  transmits  light  outside  of  that  range.  Multi¬ 
ple  dichroic  mirrors  cam  be  used  to  collineau-ly  sum  multiple  beaims  through  appropriate 
choice  of  the  input  wavelengths  in  each  arm,  and  of  the  characteristic  wavelengths  of  each 
succeeding  dichroic  mirror. 

Consider  next  the  caise  of  auidition  of  two  coherent  optical  signal  beams  in  a  collinear 
geometry.  An  example,  again  using  a  beamsplitter,  is  shown  schematically  in  Figure 
15.1(b).  The  presence  of  the  beaunsplitter  generates  a  7r/2  phase  shift  for  each  transmitted 


beam,  and  a  ir  phase  shift  for  each  reflected  beam  [Haus,  1984].  In  this  case,  due  to  the 
coherent  nature  of  the  two  input  signal  beams,  the  output  intensity  is  no  longer  given 
simply  by  the  sum  of  the  input  intensities.  In  fact,  the  output  amplitude  is  proportional 
to  the  sum  of  the  input  signal  amplitudes  (provided  that  the  two  beams  have  identical 
polarizations): 


flout  —  fl2®  ) 


(3) 


in  which  <f>  is  the  relative  phase  between  the  two  wavefronts  (here  assumed  constant  over  the 
detector  aperture).  It  should  be  noted  parenthetically  that  optical  beams  with  orthogonal 
polaxizations  do  not  interfere,  and  hence  follow  the  incoherent  addition  rule.  We  assume 
throughout  this  chapter  that  all  beams  are  polarized  identically. 

From  this  simple  equation,  several  important  principles  can  be  seen  to  emerge.  First, 
the  representation  we  must  choose  for  simple  addition  to  occur  with  coherent  light  is  differ¬ 
ent  than  in  the  case  of  incoherent  light:  in  the  coherent  case,  we  must  use  the  amplitudes 
(containing  phase  information),  whereas  in  the  previous  (incoherent)  case  euldition  is  lin¬ 
ear  in  the  intensities.  Second,  the  input-output  transformation  represented  by  Equation 
(3)  reveals  an  easy  method  for  implementing  both  addition  and  subtraction:  we  merely 
set  the  phase  difference  to  — 7r/2  for  addition,  and  to  ir/2  for  subtraction.  This  can  be 
accomplished  either  by  adjusting  the  relative  path  lengths  of  the  two  input  bearas,  or  by 
inserting  an  appropriately  oriented  wave  plate  in  one  of  the  two  beams.  In  the  incoherent 
case  treated  above,  no  such  algorithm  exists  since  we  are  adding  intensities  (which  are 
positive  definite  quantities),  and  direct  subtraction  is  not  possible  without  intermediate 
intervention  by  an  active  optical  or  optoelectronic  device.  Third,  note  that  the  second 
output  beam  is  now  not  symmetric  with  the  first: 


=  -f  ioje’^). 


(4) 


This  asymmetry  in  output  amplitudes  directly  results  from  the  asymmetry  between  the 
phases  of  the  reflected  and  transmitted  components  in  a  partially  transmitting  mirror 
[Haus,  1984). 
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Since  the  intensity  of  an  optical  beam  is  related  to  its  amplitude  by  the  relation: 


Im  —  (^m)  ■  (^) 

in  which  am  is  the  vector  amplitude  of  the  wave  representing  its  polarization,  *  represents 
the  complex  conjugation  operation,  and  T  represents  the  transpose  of  the  vector,  the 
output  intensities  in  the  two  coherently  summed  channels  are  given  by: 


hut  =  ■^[o-l  +  al  +  20102  sin(^], 

and 

=  ^[^1  +  ^2  -  20102  sin<^]. 


(6) 

(7) 


as  shown  in  Figure  15.1(c).  From  these  two  equations,  it  can  be  seen  that  for  arbitrary 
values  of  the  phase  shift  <f>,  the  output  intensity  is  not  simply  related  to  the  sum  of  the  input 
intensities,  but  instead  has  a  seemingly  undesirable  cross  term.  We  can  use  this  cross  term 
to  advantage  by  noting  that  for  phase  shifts  of  0  or  any  integer  multiple  of  tt,  both  output 
intensities  are  equal,  and  reduce  to  the  expression  previously  noted  for  the  incoherent  case 
(Equation  (1)).  Thus  for  coherent  illumination,  we  can  either  perform  addition  directly 
with  the  amplitudes,  or  with  the  intensities  if  we  are  C2u’eful  about  proper  phasing  of  the 
input  signals.  One  difficulty  with  the  former  approach  is  that  most  detectors  aie  linear  in 
intensity  but  not  in  amplitude,  as  will  be  discussed  in  further  detail  below. 

In  direct  analogy  with  the  analysis  presented  for  the  case  of  incoherent  multiple  signal 
beam  summation,  we  can  extend  the  above  equations  to  include  the  case  of  multiple 
collinear  coherent  inputs  using  appropriate  combinations  of  beamsplitters.  For  N  coherent 
input  beams  summed  optimally,  the  output  amplitude  is  given  by: 


Vn 

In  order  to  achieve  this  result,  one  must  again  use  N  —  \  beamsplitters  with  (intensity) 
transmissivities  identical  to  those  employed  in  the  incoherent  case,  and  in  addition  the 


[oi  +  02  +  03  + - +  Oyv]. 


(8) 


► 


relative  phases  must  be  arranged  such  that  the  phase  difference  between  oj  and  Oi  is 
— 7r/2,  amd  each  successive  beam  is  increased  in  phcise  by  7r/2. 

► 

In  all  of  the  cases  described  above,  we  have  constrained  the  problem  by  requiring  that 
the  output  beams  all  be  collinear,  and  in  fact  many  proposed  neuro-optical  architectures 
implicitly  demand  such  a  constraint.  We  will  show  in  later  sections,  however,  that  this 
^  is  perhaps  an  unnecessary  and  in  many  cases  undesirable  constraint.  Hence  we  consider 

here  also  the  case  of  optical  addition  with  noncollinear  output  beams,  requiring  instead 
only  that  the  summed  beams  fill  the  same  detector  aperture.  There  are  in  fact  a  number 
of  interesting  variants  of  these  two  constraints,  but  we  will  limit  the  discussion  to  the  two 
^  principal  cases  only.  One  possible  such  configuration  is  shown  in  Figure  15.2(a),  in  which 

two  incoherent  signal  beams  are  summed  within  the  detector  aperture  by  using  two  100% 
reflecting  mirrors,  producing  an  output  intensity  given  by: 

lout  =  A  +  h-  (9) 

This  output  intensity  is  uniform  across  the  detector  aperture,  as  shown  schematically  in 
^  the  figure.  In  addition,  relaxation  of  the  requirement  for  collinearity  can  be  seen  to  now 

allow  for  the  use  of  mirrors  instead  of  beamsplitters,  eliminating  the  loss  we  found  in  the 
previous  case  at  the  expense  of  increased  angular  multiplexing. 


FIGURE  15.2  Illustration  of  optical  addition  utilizing  mirrors:  (a)  angular¬ 
ly  multiplexed  incoherent  beam  geometry;  (b)  angularly  multiplexed  coherent 
beam  geometry. 


Up  to  this  point  in  the  discussion,  we  have  had  to  consider  the  phase  of  the  optical 
wavefronts  only  for  the  case  of  collinear,  coherent  addition.  In  that  case,  we  only  needed 
to  use  the  relative  phase  shift  between  the  two  beams  to  derive  Equations  (6)  and  (7), 
since  the  phase  shift  is  constant  in  both  space  and  time  over  the  detector  spatial  and 
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temporal  integration  windows.  In  order  to  consider  the  case  of  noncollinear,  coherent 
addition,  however,  we  must  <dlow  for  the  space-variant  phase  shifts  that  naturally  result 
when  two  coherent  wavefronts  cross  at  a  non-zero  angle.  These  effects  are  automatically 
tak  ■  into  account  if  we  express  each  wave  (beam)  amplitude  in  a  form  that  incorporates 
both  its  magnitude  and  its  phase  everywhere  in  space  at  a  given  instant  of  time.  For  a 
plane  wave  (in  which  the  planes  of  constant  phase  are  oriented  normal  to  the  direction 
of  propagation),  it  proves  convenient  to  use  the  form  aexp(tk-r),  in  which  a  is  a  vector 
representing  the  polarization  of  the  optical  wave  (its  amplitude  in  each  of  the  principal 
coordinate  directions),  k  is  the  wave  vector  of  the  optical  wave  (defined  as  a  vector  with 
direction  normal  to  the  planes  of  constant  phase,  and  with  magnitude  |A:|  =  2‘jrnfX,  in 
which  n  is  the  refractive  index  of  the  propagation  medium  and  A  is  the  wavelength  of  the 
light  wave  in  vacuum),  and  r  is  a  position  vector  defined  from  an  arbitrary  origin  in  space 
{r  =  XX  +  yy  +  zz,  in  which  x,y,  and  z  are  unit  vectors  along  the  Cartesian  coordinate 
axes. 

For  the  case  of  coherent  illumination,  then,  the  result  of  noncollinear  summation  is  as 
shown  schematically  in  Figure  15.2(b).  In  this  case,  the  phase  of  each  input  wave  varies 
across  the  detector  aperture  (assumed  to  lie  in  the  plane  r  =  0)  at  a  rate  that  is  a  function 
of  the  anguleir  separation  of  the  incident  beams,  as  well  as  of  the  angular  deviation  of  the 
bisector  from  normal  incidence.  This  condition  can  be  represented  by  writing  the  wave 
amplitudes  with  a  space- vciriant  phase,  which  is  in  t’’rn  dependent  on  the  i- component 
of  the  wave  vector  kx  in  the  form  aiexp(iA:xx).  The  two  waves  will  thus  interfere  in  the 
plane  of  the  detector,  forming  essentially  a  new  wave  with  a  local  amplitude  given  by  the 
sum  of  the  incident  amplitudes.  The  resulting  intensity  pattern  has  both  a  space-invariant 
(uniform)  and  a  space-variauit  (sinusoidal)  component: 

lout  =  a]  +  al  +  2aia2  cos  2kxX.  (10) 

If  we  assume  that  the  detector  is  linear  in  intensity  over  the  dynamic  range  represented 
by  this  equation,  and  furthermore  that  the  detector  is  uniform  in  responsivity  over  its 
aperture,  then  the  output  from  the  detector  will  be  the  spatial  average  of  this  interference 


pattern,  resulting  in  an  output  intensity  that  is  in  fact  a  sum  of  the  input  intensities,  jls 
represented  by  Equation  (9).  In  this  case,  the  result  is  independent  of  the  relative  phase 
shiii  (difference)  between  the  two  beams  at  their  points  of  entry  into  the  beam  combiner 
system,  since  such  a  phase  shift  will  merely  result  in  a  translation  of  the  interference 
pattern  without  altering  its  integrated  value.  This  result  also  can  be  extended  to  the  case 
of  multiple  input  signal  beams,  with  the  stipulation  that  mirrors  cannot  be  allowed  to 
occlude  each  other;  hence,  for  a  given  beam  width,  only  a  certain  number  of  beams  can 
be  combined  without  loss  by  means  of  this  method  without  overcrowding  the  available 
angular  spectrum. 

The  operation  of  optical  multiplication  is  fundamentally  different  in  a  number  of  ways 
from  those  of  addition  and  subtraction.  Perhaps  the  most  important  difference  is  that  the 
multiplication  of  either  beam  amplitudes  or  intensities  cannot  be  accomplished  directly, 
but  must  instead  utilize  a  nonlinear  medium  of  some  form  within  which  the  beams  can 
interact.  There  are  two  principal  types  of  interactions  to  consider;  those  in  which  the  two 
beams  must  be  present  simultaneously  in  order  to  form  the  desired  product,  and  those 
in  which  the  two  beams  are  utilized  sequentially  in  time.  In  general,  the  former  interac¬ 
tions  tend  to  operate  on  the  amplitudes  and  hence  require  mutual  coherency,  whereas  the 
latter  interactions  typically  form  products  of  (incoherent  or  coherent)  intensities,  which 
are  therefore  more  straightforward  to  detect  with  currently  available  intensity-sensitive 
detectors. 

Simultaneous  multiplication  of  two  optical  beams  is  suggested  by  Figure  15.2(b),  in 
which  two  coherent  signal  beams  are  angularly  multiplexed  to  form  the  interference  pattern 
gi’’en  by  Equation  (10).  Note  that  the  space- variant  part  of  the  output  intensity  in  the 
plane  of  the  “detector”  is  proportional  to  the  product  of  the  amplitudes,  t.e.,  is  of  the 
form  2aia2  cos2A:iX.  If  instead  of  employing  a  uniform  (spatially  averaging)  detector  as 
before,  we  were  now  to  employ  a  space-variant  detector  sensitive  to  the  local  intensity, 
it  is  possible  to  record  this  modulation  term  along  with  the  unmodulated  (uniform)  bias 
represented  by  the  squares  of  the  two  amplitudes.  If  in  addition  the  “detector”,  for  example, 
is  assumed  to  generate  a  change  in  either  its  absorption  coefficient  or  its  refractive  index 
as  a  function  of  the  recorded  intensity  pattern  (for  a  given  exposure),  a  diffraction  grating 
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will  be  formed.  The  resulting  diffraction  grating  can  be  characterized  by  an  amplitude 
that  is  proportional  to  the  product  of  the  input  signal  beam  amplitudes,  and  that  can 
be  probed  by  a  third  so-called  “readout”  beam.  This  is  at  once  the  basic  principle  of 
holographic  recording  (as  explained  in  more  detail  below  in  the  subsection  on  “Photonic 
Interconnections”),  and  at  the  same  time  allows  the  implementation  of  the  multiplicative 
operation  for  coherent  inputs.  It  should  be  noted  that  although  this  process  produces  a 
useful  result  for  the  case  of  two  inputs,  extension  to  larger  numbers  of  inputs  is  not  trivial, 
and  requires  the  utilization  of  higher  order  terms  in  the  susceptibility  tensor  (representing 
the  complex  dielectric  constant)  for  implementation.  The  one  exception  to  this  rule  is 
the  use  of  the  probe  (readout)  beam  intensity  Ip  as  &  third  effective  input,  in  which  case 
output  intensities  proportional  to  either  Ipaio^  or  /p/1/2  can  be  detected  depending  on  the 
operational  parameters  of  the  recording  medium  and  readout  configuration. 

If  the  simultaneity  requirement  is  relaxed  to  allow  for  sequential  interactions  in  an 
intervening  photosensitive  medium,  then  it  is  possible  to  multiply  two  incoherent  input 
signals  by  means  of  the  simple  generic  scheme  shown  in  Figure  15.3.  The  medium  again 
acts  as  an  effective  detector  for  beam  1,  generating  a  transmittance  (in  its  range  of  linear¬ 
ity)  proportional  to  the  intensity  of  beam  1.  This  transmittance  can  be  generated  either 
directly,  or  through  the  exposure  given  by  the  product  of  the  intensity  and  the  exposure 
time  as  in  the  familiar  case  of  photographic  film.  Beam  2  is  effectively  employed  as  a  probe 
beam,  such  that  the  output  intensity  is  given  by: 

/out  =  c/1/2,  (11) 

as  desired,  in  which  c  is  a  proportionality  constant  subject  to  the  constraint  that: 

c/i  <  1.  (12) 

This  process,  as  in  the  previous  case,  is  extendable  to  accommodate  an  arbitrary  number 
of  inputs  by  iteration,  unfortunately  resulting  both  in  a  lengthy  generation  sequence  for  a 
large  number  of  inputs,  and  in  the  potential  for  significant  nonlinear  effects  with  a  heavily 
constrained  overall  dynamic  range.  For  the  case  of  N  input  beams,  we  can  utilize  —  1 
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exposure  steps  in  combination  with  N  —  1  intermediate  readout  steps  ajid  a  final  readout 
step  with  beam  N  to  generate  an  output  intensity  of  the  form: 

lout  =  c^-^hhh  ■  •  •  In-Jn,  with  c^-^hhh  •  •  •  In-i  <  1.  (13) 

In  direct  analogy  to  the  case  of  summation,  we  could  instead  utilize  a  binary  tree  structure, 
which  requires  only  log2N  time  steps  but  uses  the  same  number  of  devices. 

Finally,  it  should  be  noted  that  this  latter  process  of  incoherent  beam  multiplication 
through  an  intervening  medium  by  sequential  illumination  is  suggestive  of  the  process 
of  spatial  light  modulation,  in  which  the  same  basic  concept  is  extended  to  cover  a  two- 
dimensional  array  of  multiplication  elements.  In  fact,  this  process  is  an  essential  component 
of  the  general  area  of  photonic  switching,  to  which  we  will  turn  our  attention  below. 


FIGURE  15.3  Illustration  of  optical  multiplication  utilizing  a  medium  with 
variable  transparency. 


Before  turning  to  the  topics  of  photonic  switching  and  photonic  interconnections,  we 
conclude  this  section  with  a  discussion  of  the  fourth  principal  computational  process  that 
can  be  performed  optically  (as  opposed  to  optoelectronicfilly,  eis  discussed  below):  that 
of  the  incorporation  of  functional  nonlinearity.  Although  many  types  of  functional  non- 
linearities  are  of  interest  in  a  generalized  analog  computational  system,  those  of  primary 
utility  in  the  neural  network  environment  aire  for  the  most  pfirt  threshold-like  in  nature.  A 
threshold  function  fri^)  of  some  input  variable  i  (such  as  the  input  intensity,  for  example) 
can  be  described  in  general  by: 


• 

/r(l)  =  Tmin, 

—  OO  <  X  <  Xi 

/r(i)  =  m(z); 

Xi  <  X  <  X2 

frix)  =  Tmax', 

l2  <  Z  <  OO 
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in  which  the  function  m(i)  is  a  monotonic  function  with  a  minimum  value  of  Tmin  and  a 
maximum  value  of  Tmax-  For  a  step  function  response,  the  function  m(x)  can  be  eliminated 
by  setting  xi  =  X2-  many  cases,  a  smoother  transition  between  the  two  extreme  states 
hais  been  found  to  generate  enhanced  network  stability  and  faster  settling  times.  In  such 
ccises,  the  function  m(x)  may  be  taken,  for  example,  as  a  sigmoid  with  an  exponential 
onset  and  an  asymptotic  approach  to  the  saturation  level. 

The  incorporation  of  such  nonlinear  functionality  by  direct  optical  means  can  be  achieved 
through  the  use  of  a  number  of  different  types  of  nonlinear  materials;  such  materials  typ¬ 
ically  exhibit  a  change  in  their  refractive  index  or  absorption  coefficient  proportional  to 
the  first  and  higher  order  powers  of  the  local  optical  intensity.  One  example  of  such  a 
material  is  photographic  film,  which  after  development  exhibits  a  (negative)  sigmoid-like 
exposure  characteristic,  with  a  saturation  value  determined  by  the  maximum  optical  den¬ 
sity  achievable  within  a  given  film  thickness.  (The  optical  density  (OD)  of  a  medium  is 
given  by  the  negative  of  the  decadic  logcirithm  of  its  transmittance;  for  example,  a  film  that 
transmits  1%  of  the  incident  illumination  has  an  optical  density  of  two  (OD2)).  Another 
common  example  of  an  optical  nonlinearity  is  the  photoconductive  saturation  behavior  of 
certain  semiconductor  materials  such  as  cadmium  sulfide,  zinc  selenide,  and  silicon.  In 
this  latter  case,  the  distinction  between  an  “all-optical”  nonlinearity  and  an  optoelectronic 
nonlinearity  becomes  somewhat  blurred,  as  the  photoconductor  can  be  thought  of  as  a 
light-sensitive  electronic  device. 

Such  optical  techniques  for  the  generation  of  functional  nonlinearities  at  present  suffer 
seversJ  inherent  disadvantages,  in  that  they  often  require  either  on  off-line  post-exposure 
development  step  (which  is  unsuitable  for  real  time  operation  at  high  frame  rates),  long 
response  times,  or  very  high  optical  intensities  to  achieve  saturation.  In  addition,  such 
materials  as  of  yet  have  not  proven  to  be  readily  prograuumable,  which  is  often  a  desirable 
feature  from  the  systems  perspective  in  order  to  accommodate  variable  threshold  functions, 
g2un,  saturation  values,  and  offsets.  As  we  will  see  in  the  next  section,  the  incorporation 
of  electronic  circuitry  with  optical  detectors  and  modulators  to  achieve  optoelectronic  non- 
linearities  can  in  fact  greatly  increase  the  threshold  sensitivity  and  operational  bandwidth 
of  nonlinear  switching  elements,  while  simultaneously  providing  flexible  programming  ca- 
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Photonic  Switching 


The  switching  function,  that  of  providing  an  output  that  is  (perhaps  nonlinearly)  de¬ 
pendent  on  one  or  more  inputs,  is  a  principal  distinguishing  characteristic  of  neuron  units. 
Electronic  circuit  elements  (particularly  as  configured  by  very  large  scale  integration  tech¬ 
niques)  are  quite  well  suited  to  the  switching  task,  as  long  as  the  number  of  inputs  (rep¬ 
resenting  the  fan-in)  and  the  number  of  outputs  (representing  the  fzin-out)  are  both  kept 
relatively  small  (less  than  a  few  hundred  or  so  for  the  case  of  analog  fan-in  and  fan-out). 
However,  for  neural  network  implementations  that  demand  a  high  degree  of  connectivity 
(with  a  concomitantly  large  number  of  neuron  units),  the  required  gate  count  as  well  as 
the  area  required  for  interconnection  routing  in  purely  electronic  implementations  rapidly 
gets  out  of  hand. 

The  fundamental  aspects  of  the  fan-in  and  fan-out  components  of  the  switching  function 
are  quite  distinct,  and  lead  to  different  types  of  demands  on  the  chosen  implementation 
technology.  The  fan-in  of  a  number  of  inputs  requires  that  a  particular  functional  relation¬ 
ship  be  established  between  the  generated  output,  on  the  one  hand,  and  the  set  of  inputs, 
on  the  other.  In  the  case  of  a  neural  network,  the  output  typically  depends  on  both  sums 
£md  differences  of  various  combinations  of  the  inputs.  Therefore,  a  given  implementation 
technology  must  properly  generate  the  requisite  logical  or  functional  relationship,  as  well 
as  provide  for  an  appropriate  physical  input  mechanism  {e.g.  the  input  leaxls  in  the  case 
of  an  electronic  implementation).  For  electronic  circuits,  the  network  asea,  required  for  the 
provision  of  input  leads  and  functional  circuitry  typically  scales  directly  with  the  number  of 
inputs,  which  is  an  unfortunate  dependence  when  the  number  of  inputs  is  lairge.  Fan-out, 
on  the  other  hand,  usually  implies  the  broadcast  of  a  single  output  value  to  a  number  of 
(input)  locations  or  nodes.  In  electronics,  the  output  power  required  to  drive  the  inputs  to 
a  large  number  of  nodes  scales  directly  with  the  number  of  these  nodes,  which  again  does 
not  scale  favorably  (but  turns  out  to  be  an  unavoidable  penalty  in  any  case).  Significant 


(iV-fold)  fan-out  often  involves  the  incorporation  of  high  power  driver  circuitry,  which  may 
have  to  be  duplicated  M  times  {M  <  N)  in  order  to  avoid  unacceptable  loading  of  the 
output  stages. 

The  combination  of  both  fan-in  and  fan-out  components  of  the  switching  function 
reveals  a  further  demand  on  the  real  estate  required  for  the  establishment  of  weighted 
interconnections.  In  a  fully  connected  neural  network  with  N  neurons,  for  example,  area 
must  be  provided  for  the  incorporation  (storage  and  programming)  of  independent 
weights  as  well  as  independent  signal  pathways.  Hence,  the  chip  area  required  in  a 
VLSI  circuit  implementation  of  such  a  fully  connected  neural  network  will  scale  at  least  as 
the  square  of  the  number  of  neuron  units  {0(N^)).  Network  segmentation  into  a  number  of 
interconnected  chips  can  help  somewhat  to  expand  the  network  size  beyond  the  limitation 
imposed  by  applying  this  constraint  to  a  single  chip.  However,  the  limiting  factor  in  the 
multiple-chip  case  rapidly  becomes  interchip  communication  (I/O),  as  pinouts  from  VLSI 
chips  of  greater  than  two  hundred  or  so  are  not  technologically  feasible  at  present. 

Photonic  implementations  of  neural  networks  take  advantage  of  the  simple  beaim- 
combining  mechanisms  outlined  above  to  multiplex  inputs  and  outputs,  and  as  such  exhibit 
much  higher  capacity  for  fan-in  and  fan-out  than  do  typical  electronic  implementations. 
The  utilization  of  optical  rather  than  electronic  interconnections  for  the  fan-in  and  fan-out 
functions  provides  for  completely  different  scaling  laws  at  large  numbers  of  inputs  and 
outputs  to  a  given  neuron  unit,  as  described  in  more  detail  in  the  following  subsection 
(“Photonic  Interconnections”). 

Even  given  photonic  interconnections  with  a  high  degree  of  fan-in  and  fan-out  capabili¬ 
ty,  the  nonlinear  functional  (switching)  relationship  between  the  output  and  combinations 
of  the  inputs  must  still  be  provided  for.  For  purposes  of  neural  network  implementation, 
the  primary  photonic  switching  component  is  the  spatial  light  modulator,  a  device  that 
alters  either  the  amplitude  or  phase  across  an  expanded  probe  beam  in  response  to  the 
local  intensity  (or  exposure)  across  an  input  (writing  or  recording)  beam. 

The  simplest  example  of  a  spatial  light  modulator,  albeit  one  that  cannot  operate  at 
real  time  frame  rates,  is  photographic  film.  Following  exposure  to  an  information-bearing 
optical  field,  in  which  an  image  of  a  given  scene  is  brought  into  focus  on  the  two-dimensional 


plane  of  the  film,  a  “latent”  (undeveloped)  image  is  formed  within  the  photographic  emul¬ 
sion  on  the  surface  of  the  film.  Chemical  development  is  used  to  transform  the  latent 
image  into  a  measurable  change  in  the  optical  transparency  (transmissivity)  of  the  film, 
which  can  then  be  “read  out”  or  probed  by  secondary  illumination  to  reveal  features  of 
the  recorded  scene.  In  this  context,  slide  projection  is  in  fact  the  equivalent  of  amplified 
readout  with  a  probe  beam,  in  the  sense  that  the  reconstruction  of  the  image  is  accom¬ 
plished  at  a  much  higher  level  of  intensity  (for  a  longer  period  of  time)  than  the  original 
exposure. 

As  can  be  seen  from  this  example,  the  basic  functions  performed  by  a  spatial  light  mod¬ 
ulator  are  those  of  detection,  functional  transformation,  and  optical  modulation,  «is  shown 
schematically  in  Figure  15.4(a).  In  the  case  of  photographic  film,  the  detection  process  oc¬ 
curs  at  photosensitive  centers  during  exposure,  the  functional  transformation  (the  transfer 
function  that  relates  the  output  transparency  to  the  input  exposure)  is  incorporated  during 
development  and  fixing,  and  the  optical  modulation  process  is  accessed  during  readout. 
This  division  of  the  spatial  light  modulation  function  into  three  key  elements  is  partic¬ 
ularly  useful  in  the  discussion  of  optoelectronic  spatial  light  modulators,  which  typically 
consist  of  separately  identifiable  detectors,  control  circuitry,  and  modulation  elements,  as 
shown  schematically  in  Figure  15.4(b-d).  This  functional  division  also  allows  extensive  use 
to  be  made  of  sophisticated  electronic  circuitry  deployed  locadly  within  each  pixel,  both 
to  generate  programmable  nonlinear  control  functions  and  to  compensate  to  a  certain  de¬ 
gree  for  the  nonidealities  inherent  in  the  optical  detection  and  optical  modulation  elements. 


FIGURE  15.4  Fundamental  principles  of  spati£il  light  modulator  function: 
(a)  block  diagram  of  the  principal  functions  of  an  opticailly-addressed  spatial 
light  modulator,  including  the  detection,  functional  implementation,  and  mod¬ 
ulation  functions;  (b)  schematic  diagram  of  an  A/^  x  iV  array  of  spatial  light 
modulator  pixels,  in  which  three  pixels  are  shown  in  different  transmission  s- 
tates;  (c)  expanded  view  of  the  pixel  array,  showing  an  incomplete  fill  factor 
within  each  pixel;  (d)  expanded  view  of  a  single  pixel  within  the  array,  illustrat- 


ing  one  possible  pixel  configuration  that  incorporates  two  detector  elements  Di 
and  D2,  control  electronics  for  impedance  matching  and  functional  implementa¬ 
tion,  and  two  modulator  elements,  shown  here  in  different  transmittance  states. 


Up  to  this  point  in  the  discussion,  we  have  focused  on  optically-addressed  spatial  light 
modulators  (OASLMs)  that  respond  locally  to  the  incident  light  intensity,  as  this  light 
detection  function  is  common  to  most  envisioned  photonic  neural  network  architectures. 
Another  way  of  controlling  the  modulation  within  an  array  on  a  pixel-by-pixel  b^lsis  is  to 
configure  the  spatial  light  modulator  such  that  it  can  accept  a  serial  or  parallel  electronic 
input  signal,  which  can  be  decoded  (or  demultiplexed)  to  drive  each  individual  modulator 
element.  Such  electrically-addressed  spatial  light  modulators  (EASLMs)  can  be  driven,  for 
example,  by  the  output  of  a  television  camera  to  again  combine  the  functions  of  detection, 
functional  transformation  (which  may  be  accomplished  in  an  external  circuit),  and  mod¬ 
ulation.  One  advantage  of  such  a  combination  is  the  current  advanced  state  of  the  art  in 
closed  circuit  television  cameras  (CCTVs),  which  exhibit  exceedingly  high  performance  at 
relatively  low  cost.  One  notable  disadvantage,  however,  is  the  implied  limitation  on  the 
frame  rate  of  the  combined  device,  since  most  high  resolution  TV  ceimeras  are  designed  to 
operate  at  less  than  one  hundred  frames  per  second. 

Over  the  past  two  decades,  a  wide  range  of  physical  modulation  mechanisms  have  been 
investigated  for  use  in  various  types  of  spatial  light  modulators.  Such  mechanisms  include 
the  modulation  of  the  index  of  refraction  or  birefringence  in  single  crystal  materials  by 
means  of  an  applied  electric  field  (the  electrooptic  effect),  the  reorientation  of  liquid  crys¬ 
tal  molecules  (producing  in  turn  a  change  in  the  index  of  refraction  or  birefringence)  by 
either  an  applied  electric  field  or  by  local  optically-induced  heating,  changes  in  coloration 
produced  by  optical  absorption  (the  photochromic  effect),  modulation  of  the  polarization 
of  reflected  light  by  application  of  local  magnetic  fields  (the  magnetooptic  effect),  surface 
deformations  in  a  thin  film  or  membrane  induced  by  either  applied  electric  fields  or  local 
optically-induced  heating,  changes  in  the  local  refractive  index  induced  by  the  applica¬ 
tion  of  pressure  or  by  the  transmission  of  an  acoustic  wave  (the  acoustooptic  effect),  and 


electric  field  modulation  of  the  absorption  or  dispersion  properties  of  semiconductor  device 
structures.  The  utilization  of  these  physical  modulation  mechanisms  in  various  spatial  light 
modulator  configurations  has  been  addressed  in  a  number  of  review  articles  [Tanguay,  1985; 
Warde,  1987],  joumal  special  issues  \e.g..  Spatial  Light  Modulators  for  Optical  Information 
Processing,  1989],  and  topical  conference  proceedings  [e.^..  Spatial  Light  Modulators  and 
Applications,  1990]. 

The  principal  configurational  and  operational  characteristics  of  spatiaJ  light  modulators 
that  are  of  interest  for  application  to  neural  networks  include  optical  sensitivity,  write 
(input)  wavelength,  read  (output)  wavelength,  input-output  transfer  function,  functional 
programmability,  operational  bandwidth,  degree  of  integration,  pixel  size,  total  number  of 
pixels  per  chip,  output  modulation  contrast  ratio,  dynamic  range,  and  dissipated  power 
density.  In  many  cases,  these  characteristics  are  interdependent,  and  thus  impose  at  times 
contradictory  design  constraints  that  must  be  optimized  in  the  overall  systems  context. 
The  fundamental  and  technological  limitations  that  affect  device  design  and  performance 
are  discussed  further  below  and  in  a  succeeding  section. 

As  is  the  case  for  electronic  circuitry,  both  monolithic  and  hybrid  approaches  to  the 
development  of  optoelectronic  spatial  light  modulators  with  suitable  functionality  have 
been  employed.  In  the  monolithic  approach,  the  detectors,  control  circuitry,  and  mod¬ 
ulation  elements  within  each  individual  picture  element  (pixel)  are  integrated  within  a 
single  class  of  materials  on  a  supporting  substrate,  «is  shown  schematically  in  Figure  15.5. 
An  exzunple  of  such  am  approach  is  the  integration  of  p-n  or  p-i-n  junction  photodiodes 
with  metal-semiconductor  field  effect  transistors  (MESFETs)  [Sze,  1981a]  to  drive  multi¬ 
ple  quantum  well  (MQW)  optical  modulators  baised  on  the  quantum  confined  Stark  effect 
(QCSE)  [Miller,  1990],  all  fabricated  by  means  of  photolithographic  processing  with  mul¬ 
tiple  maisk  levels  on  gallium  arsenide  (GaAs)  substrates.  In  Figure  15.5,  two  distinct 
approaches  to  the  monolithic  integration  of  spatial  light  modulators  are  illustrated,  differ¬ 
entiated  primarily  by  the  method  employed  to  physically  or  electrically  isolate  (pixelate) 
the  modulator  elements. 

Two  particularly  critical  parameters  of  spatial  light  modulators  used  in  neural  network 
implementations  are  the  contrast  ratio  and  dynamic  range  of  the  modulator.  Their  values 


can  in  certain  cases  be  increased  by  incorporating  the  active  modulation  layer  (for  example, 
a  multiple  quantum  well  device)  within  a  symmetric  or  asymmetric  optic£il  (Fabry-Perot) 
cavity  [Whitehead,  1989a;  Whitehead,  1989b;  Whitehead,  1989c;  Yan,  1989].  The  zisym- 
metric  case  is  shown  schematically  in  Figure  15.5(a),  in  which  two  multilayer  Bragg  mirrors 
are  used  to  form  a  reflective  cavity  with  a  high  reflectivity  (R)  on  the  substrate  side,  and  a 
lower  reflectivity  on  the  air-incident  side.  One  of  several  advantages  of  monolithic  integra¬ 
tion  is  the  potential  for  utilizing  common  components  for  multiple  purposes.  For  example, 
the  basic  MQW  modulator  structure  can  also  be  used  as  a  p-i-n  photodetector  by  applica¬ 
tion  of  appropriate  bias  voltages,  as  shown  in  Figure  15.5(b).  To  date,  significant  progress 
in  such  monolithically  integrated  optical  modulators  hcis  been  achieved,  although  spatial 
light  modulators  with  large  numbers  (>  lO**)  of  pixels  have  not  yet  been  fabricated  that 
exhibit  the  relatively  high  degree  of  integration  described  above. 


FIGURE  15.5  Examples  of  monolithicaJly-integrated  spatial  light  modula¬ 
tors.  The  chosen  examples  incorporate  photodetectors,  control  circuitry,  and 
multiple  quantum  well  modulators  within  each  pixel  on  a  single  gallium  ar¬ 
senide  (GaAs)  substrate.  In  (a),  the  control  electronics  and  photodetector 
elements  au'e  fabricated  following  the  photolithographic  definition  and  physical 
isolation  of  the  modulator  elements,  while  in  (b)  a  buffer  (isolation)  layer  is 
used  to  allow  fabrication  and  interconnection  of  all  of  the  elements  without 
chemical  or  ion  beam  etching. 


In  the  hybrid  approach,  on  the  other  hand,  certain  of  the  device  functions  may  be  inte¬ 
grated  on  a  substrate  within  one  materials  system  (with  its  aissociated  process  technology) 
in  order  to  optimize  either  their  performance  characteristics  or  manufacturability,  while 
others  are  integrated  on  a  separate  substrate  within  a  different  materials  system  (with  a 
necessau’ily  distinct  process  technology).  Following  separate  processing  sequences  for  each 
individual  component,  the  two  substrates  are  then  interconnected  (bonded  together)  such 


that  the  mating  pixels  on  each  substrate  are  in  pairwise  electrical  contact.  For  example, 
severtil  currently  investigated  types  of  spatial  light  modulators  (SLMs)  incorporate  the  de¬ 
tection  elements  and  control  circuitry  on  a  silicon  (Si)  substrate  utilizing  standard  VLSI 
design  rules,  while  the  modulation  elements  are  based  in  a  separate  technology  (such  as 
multiple  quajitum  well  structures  integrated  on  a  GaAs  substrate).  Alternatively,  hybrid 
spatial  light  modulators  can  be  fabricated  on  a  single  common  substrate,  with  additional 
functionality  provided  by  the  growth,  deposition,  or  coating  of  a  second  active  material 
onto  the  substrate.  Examples  of  this  type  of  hybrid  SLM  include  silicon  VLSI/ferroelectric 
liquid  crystal  devices  [Drabik,  1990]  and  silicon/PLZT  devices  [Lin,  1990).  Such  hybrid 
SLMs  are  also  in  the  early  stages  of  advanced  development,  and  are  the  subject  of  current 
intensive  research  and  development  efforts  [Spatial  Light  Modulators  for  Optical  Informa¬ 
tion  Processing,  1989;  Spatial  Light  Modulators  and  Applications,  1990). 

Using  either  of  these  two  approaches  to  spatial  light  modulator  fabrication,  devices 
based  on  both  transmissive  and  reflective  readout  can  be  constructed,  with  different  impli¬ 
cations  on  the  overall  systems  design  in  each  case.  In  particular,  the  reflective  mode  can 
be  used  to  advantage  in  configuring  a  hybrid-integrated  SLM  to  mate  the  detection  and 
control  circuitry  functions  of  the  device  with  the  optical  modulation  function.  Use  of  the 
reflective  readout  configuration  allows  the  detection  and  control  circuitry  to  be  integrated 
OD  a  substrate  that  is  opaque  to  the  readout  illumination  wavelength  [Kyrialadcis,  1990], 
eis  shown  schematically  in  Figure  15.6. 


FIGURE  15.6  Example  of  a  hybrid  spatial  light  modulator,  in  which  the 
photodetectors  and  control  electronics  are  fabricated  on  a  silicon  substrate, 
and  the  multiple  quantum  well  modulator  elements  are  fabricated  on  a  gallium 
arsenide  {GaAs)  substrate.  The  two  sets  of  devices  are  bump  contacted  on  a 
pixel-by-pixel  basis  to  provide  parallel  electrical  continuity. 


As  an  example  of  the  degree  of  functional  integration  currently  envisioned  for  spatial 
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light  modulators  that  are  specihcally  designed  for  photonic  implementations  of  neur£il  net¬ 
works,  a  silicon-based  CMOS  chip  has  recently  been  designed  and  fabricated  [Asthana, 
1990a;  Asthana,  1990b]  that  incorporates  two  input  detectors,  control  circuitry,  and  two 
(optical  modulator)  output  drivers  within  each  100  x  100  fim  pixel  as  shown  schematically 
in  Figure  15.7(a).  It  should  be  noted  that  these  current  dimensions  do  not  in  any  sense 
represent  a  lower  limit,  but  rather  a  practical  size  for  laboratory  demonstrations  and  ex¬ 
periments,  as  well  as  a  useful  size  from  the  perspective  of  neural  network  applications.  The 
pixel  layout  allows  for  two  30  x  50  /im  detectors,  followed  by  a  15  transistor  dual  input, 
dual  output  differential  amplifier  that  implements  a  sigmoid-like  transfer  function,  with 
externally  programmable  saturation  characteristics.  Output  pads  are  also  provided  for 
hybrid  bonding  (by  bump  contact  techniques  [Shirouzu,  1986])  to  an  InGaAs/GaAs  mul¬ 
tiple  quantum  well  modulator  structure  fabricated  on  a  GaAs  substrate  [Kyriakakis,  1990]. 
Utilizing  2  fun  CMOS  design  rules,  the  control  circuitry  easily  fits  within  25  x  100  fim, 
leaving  adequate  space  for  the  modulator  output  pads  as  shown  in  Figure  15.7(b,  c).  This 
currently  allows  for  the  integration  of  10'*  pixels  per  cm^,  or  6  x  10*  pixels  per  in^.  The 
functional  operation  of  this  circuit  will  be  discussed  in  the  section,  “An  Implementation 
Strategy”,  below. 


FIGURE  15.7  VLSI  layout  of  a  generalizable  silicon-based  spatial  light 
modulator  structure:  (a)  neuron  pixel  layout;  (b)  photograph  of  a  single  neu¬ 
ron  unit  in  VLSI  implementation,  with  probe  pads  substituted  for  the  two 
detectors  (bottom)  and  for  contact  to  the  two  modulation  elements  (top);  (c) 
photograph  of  a  6  x  6  array  of  neuron  units  on  a  VLSI  chip  that  incorporates 
additional  test  circuitry. 


Before  leaving  the  subject  of  photonic  switching,  it  should  be  noted  that  the  general 
principles  outlined  above  can  be  used  to  design  a  wide  variety  of  mutually  compatible 
devices  with  different  functionalities  as  well  as  different  tradeoffs  among  the  set  of  con- 
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figurational  and  operational  par^uneters.  For  example,  it  is  relatively  straightforward  to 
^  design  time-integrating  and  time-differentiating  circuits;  sharp  (step-like)  thresholds;  level 

slices;  sigmoid-like  functions,  their  complements,  and  their  derivatives;  inverters;  and  log¬ 
arithmic  amplifiers.  Many  such  functions  can  be  implemented  with  only  a  few  integrated 
components,  such  as  capacitors,  diodes,  transistors  employed  as  current  amplifiers,  and  bi- 
I  ased  transistors  employed  as  resistors  [Mead,  1989].  Therefore,  these  functions  can  easily 

be  incorporated  within  each  pixel  (neuron  unit)  of  a  two-dimensional  spatial  light  modu¬ 
lator,  as  well  as  in  some  cases  between  pixels  for  the  implementation  of  non-local  (other 

than  pointwise)  operations  such  as  automatic  gain  control  and  nearest- neighbor  inhibition. 
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Photonic  Interconnections 

Given  that  the  neuron  units  are  to  be  represented  by  individual  pixels  within  a  two- 
dimensional  spatiad  light  modulator,  interconnections  must  now  be  established  between 
each  individual  neuron  unit  (pixel)  and  many  (if  not  all)  other  neuron  units.  As  such, 
the  chosen  interconnection  scheme  must  be  capable  of  the  appropriate  degree  of  fan-in 
and  fam-out,  be  charaw:terized  by  sufficient  transmission  bandwidth  in  each  chamnel,  and 
be  scalable  to  relatively  lawge  numbers  of  neuron  units.  In  addition,  the  neurad  network 
paradigm  presents  the  additional  requirement  that  the  interconnections  be  weighted,  such 
that  the  output  from  a  given  point-to-point  interconnection  is  proportional  to  the  product 
of  the  input  amd  a  stored  constant  or  weight.  It  is  in  fact  this  requirement  that  eliminates 
from  consideration  a  lau'ge  number  of  possible  switching  networks  that  provide  full  recon¬ 
figurability  in  a  non-blocking  mauiner  (such  as  a  crossbar  or  shuffle-exchange  network), 
but  without  the  capacity  to  incorporate  weights  within  each  interconnection  pathway. 
In  adaptive  networks  (those  that  incorporate  learning  algorithms),  these  interconnection 
weights  must  have  the  capability  of  being  updated  in  a  manner  determined  by  the  partic¬ 
ular  learning  algorithm  employed.  A  nontrivial  consequence  of  these  l«ist  two  requirements 
is  that  the  interconnection  weights  must  be  stored  for  at  least  as  long  as  the  average  it¬ 
erative  computation,  if  not  much  longer;  yet,  they  must  simultaneously  exhibit  dynamic 
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programmabiHty  if  the  nei*’ork  is  to  exhibit  either  short-term  or  long-term  plasticity. 

For  \ery  small  numbers  of  neurons  with  a  low  degree  of  connectivity,  one  possible 
way  of  forming  the  interconnection  network  would  be  to  use  fiber  optic  transmission  lines 
with  modulated  semiconductor  laser  diodes  as  sources  and  optical  receivers  as  detectors, 
much  like  a  fiber  optic  local  area  network.  The  weights  could  be  incorporated  by  means 
of  a.  variable  gain  amplifier  at  either  end  of  each  fiber  optic  link,  with  w'eight  storage  in 
local  dynamic  random  access  memory  (RAM)  or  static  read  only  memory  (ROM)  circuits. 
Unfortunately,  the  sheer  bulk  of  each  transmitter,  receiver,  and  fiber  optic  .link  precludes 
scalability  to  large  neural  network  systems.  For  example,  a  fully  connected  twenty  neuron 
network  would  involve  four  hundred  sets  of  sources,  transmission  lines,  and  detectors, 
which  would  currently  represent  a  prohibitive  requirement.  The  same  would  be  true  of  a 
fifty  neuron  network  with  a  fan-out  and  fan-in  of  eight,  representing  a  relatively  low  degree 
of  connectivity. 

In  order  to  be  able  to  satisfy  the  interconnection  requirements  for  a  large  number  ci 
neuron  units  that  are  fully  or  necirly  fully  interconnected,  the  appropriate  photonic  tech¬ 
nology  to  employ  is  that  of  holographic  interconnections,  in  which  the  weights  as  well  as 
the  interconnection  patterns  themselves  are  stored  as  holograms  in  either  a  fixed  (static) 
or  real  time  (dynamic)  holographic  recording  material.  In  this  section,  we  first  discuss  the 
basic  principles  that  apply  to  the  utilization  of  holographic  recording  for  point-to-point 
interconnections.  Next,  we  describe  the  physical  origins  of  a  number  of  complexities  with 
holographic  interconnection  schemes  that  lead  to  both  interchannel  crosstalk  and  through¬ 
put  losses.  An  architecture  that  lends  itself  to  the  minimization  of  sue  complications 
will  be  described  in  detail  in  the  section,  “An  Implementation  Strategy”,  below.  Finally, 
the  potential  for  incorporation  of  real  time  volume  holographic  recording  media  such  as 
photorefractive  materials  in  holographic  interconnection  networks  is  addressed. 

The  essential  principle  of  holographic  recording,  that  of  the  space- Vari amt  interference 
of  two  mutually  coherent  wavefronts,  was  discussed  briefly  in  reference  to  Equation  (10) 
and  is  illustrated  in  Figure  15.2(b).  In  this  figure,  two  angularly  separated  (noncol linear) 
collimated  beams  are  incident  on  a  photosensitive  material,  such  that  their  mutual  inter¬ 
ference  locally  exposes  the  material  to  the  intensity  distribution  given  by  Equation  (10). 


In  Figure  15.2(b),  the  photosensitive  materia!  was  assumed  to  spatially  integrate  2u:ross 
the  interference  pattern,  producing  an  output  that  depends  on  only  the  spatial  average 
of  the  intensity  distribution.  Suppose  now  that  we  use  instead  a  photosensitive  material 
with  the  property  that  its  local  index  of  refraction  or  absorption  coefficient  depends  on  the 
local  incident  intensity  (exposure),  which  allows  the  complete  interference  pattern  to  be 
recorded.  The  resulting  change  in  the  local  optical  properties  of  the  medium  may  either 
be  immediate  (as  in  the  case  of  a  photochromic  transformation,  for  example),  or  may  re¬ 
quire  development  following  exposure  (as  in  the  case  of  bleached  photographic  negatives 
or  dichromated  gelatin  thin  films).  Figure  15.8(a)  shows  such  a  detection  or  recording 
geometry  in  which  a  thin  semi-transparent  layer  of  photosensitive  material  acts  as  a  quasi- 
planar  holographic  recording  medium.  The  interference  pattern  produced  by  the  mutually 
coherent  signal  and  reference  beams  within  the  holographic  recording  medium  is  recorded 
to  form  a  diffraction  grating  within  the  volume  accessed  by  both  beams  simultaneously, 
as  shown  in  the  figure.  For  simplicity  in  Figure  15.8  (as  well  as  in  subsequent  figures), 
we  have  not  shown  the  refraction  of  the  incident  and  transmitted  beams  at  the  input  and 
output  faces  of  the  holographic  recording  medium  that  occurs  due  to  a  difference  between 
the  refractive  indices  of  the  medium  and  its  surround.  The  amplitude  (and  intensity)  of 
the  reflected  beam  shown  in  Figure  15.8(b)  depends  directly  on  the  index  difference,  and 
represents  a  throughput  loss  on  readout. 


FIGURE  15.8  A  simplified  holographic  recording  configuration:  case  of 
plane  wave  signal  and  reference  beams,  and  a  thin  holographic  recording  medi¬ 
um;  (a)  recording,  and  (b)  reconstruction  with  a  plane  wave  readout  beam. 


Consider  first  the  c^lse  of  an  exposure- dependent  refractive  index  variation.  Illumination 
of  such  a  space- variant  modulation  of  the  refractive  index  by  a  coherent  collimated  beam  of 
the  same  wavelength  A  as  the  exposure  (writing)  beams  will  result  in  a  diffraction  pattern 
consisting  of  several  collimated  beams,  each  emanating  in  a  characteristic  direction  as 
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shown  schematically  in  Figure  15.8(b),  and  as  given  by  the  following  equation: 


2ir 

kmx  =  krr  +  mKG;  |Kg|  =  T— =  |k2  -  kil;  m  =  0,±l,±2,' 

Ac 


(15) 


In  this  equation,  k,nr  is  the  x-component  of  the  wave  vector  of  the  diffracted  beam 
(diffraction  order),  Kg  is  the  wave  vector  (assumed  oriented  along  the  x-axis)  of  the 
interference  pattern  (diffraction  grating)  formed  by  the  two  writing  bejuns  (with  wave 
vectors  ki  and  k2),  k^x  is  the  x-component  of  the  wave  vector  of  the  incident  readout 
beam  propagating  in  the  x-z  plane,  and  Aq  is  the  spatial  wavelength  of  the  diffraction 
grating.  The  multiple  diffracted  orders  result  from  the  phase  modulation  of  the  readout 
(probe)  beam  by  the  refractive  index  modulation  n(x)  of  the  thin  holographic  grating;  the 
magnitude  and  phase  of  the  readout  beam  amplitude  immediately  after  passing  through 
the  hologram  (located  at  the  position  ^o)  can  be  written  in  the  form: 


Anifj  =  (16) 

in  which  Adiji  is  the  amplitude  of  the  diffracted  wavefront,  <I>g{x)  =  27rn(x)d/A  is  the  local 
phase  shift  induced  by  the  diffraction  grating  (assumed  to  be  of  thickness  d), 
is  the  incident  readout  beam  amplitude  at  the  exit  plane  of  the  hologram  (zq))  and  kn  and 
krz  axe  the  x  and  z  components  of  the  wave  vector  k^,  respectively.  Each  of  the  diffracted 
orders  can  then  be  directly  associated  with  a  corresponding  Fourier  component  of  the 
modulated  amplitude  [Goodman,  1968],  which  can  be  expanded  in  terms  of  the  form: 


In  order  to  assess  the  effectiveness  with  which  the  holographic  grating  diffracts  the  incident 
beam  into  a  particular  diffraction  order,  it  is  convenient  to  define  the  diffraction  efficiency 
T}  of  each  order  as: 


^  I  1 

ki 


2  ■ 


(18) 


The  essentiad  diffraction  properties  of  thin  absorption  gratings  (in  which  the  modulation 
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occurs  in  the  local  absorption  coefficient)  are  the  same  as  for  the  case  of  thin  pure  phase 
gratings,  with  two  principal  exceptions:  (1)  for  sinusoidal  absorption  gratings,  the  diffract¬ 
ed  orders  axe  limited  to  m  =  —1,0,  and  1;  and  (2)  the  presence  of  absorption  significantly 
decreases  the  maximum  first  order  diffraction  efficiency  that  can  be  achieved. 

In  order  to  illustrate  how  such  holographic  gratings  can  be  employed  to  generate  weight¬ 
ed  point-to-point  interconnections,  we  need  to  introduce  two  additional  concepts:  the  lens 
as  an  angle-to-position  encoder,  and  the  superposition  of  holographic  gratings  recorded 
with  different  diffraction  efficiencies.  The  first  concept  can  be  understood  with  reference 
to  Figure  15.9,  in  which  a  simple  lens  is  placed  one  focal  length  away  from  a  point  source  in 
the  input  plane  of  a  photonic  interconnection,  and  a  second  simple  lens  is  placed  one  focal 
length  away  from  the  output  plane.  What  is  normally  thought  of  as  the  focal  property 
of  a  lens  results  in  the  generation  of  a  collimated  beam  (a  beam  comprising  both  parallel 
rays  and  planar  wavefronts)  following  the  first  lens,  with  an  angle  (both  in  and  out  of  the 
plane  of  the  page)  that  depends  on  the  position  of  the  point  source  in  the  focal  (input) 
plane.  In  this  sense,  the  first  lens  acts  as  a  position-to-angle  encoder,  providing  a  one- 
to-one  correspondence  between  the  input  location  and  the  output  collimated  beam  angle. 
Depending  on  the  nature  of  the  grating  stored  within  the  holographic  optical  element, 
the  collimated  beam  will  be  diffracted  into  a  new  direction  characterized  by  a  different 
angle.  The  second  lens  will  then  focus  the  diffracted  beam  to  a  point  in  the  output  plme 
that  depends  on  this  angle,  thus  acting  as  an  angle-to-position  encoder.  The  utilization  of 
different  orientations  of  gratings  within  the  holographic  optical  element  allows  for  the  inter¬ 
connection  of  £Uiy  arbitrary  point  in  the  input  plane  to  any  other  point  in  the  output  plane. 


FIGURE  15.9  A  point-to-point  interconnection  system,  using  a  holographic 
optical  element  (HOE)  for  interconnection  routing,  and  lenses  as  position-to- 
£ingle  and  angle-to-position  encoders.  In  this  example,  the  holographic  optical 
element  effectively  performs  an  input  angle  to  output  angle  transformation, 
such  that  light  emitted  (or  transmitted)  at  point  pi  in  the  input  plane  (Pi)  is 
detected  at  point  in  the  output  plane  (Pj). 


Suppose  now  that  we  do  in  fact  choose  to  superimpose  a  number  of  planar  gratings 
within  the  holographic  medium,  each  with  a  different  wave  vector  (orientation  and  grat¬ 
ing  period)  and  grating  modulation  (variation  of  the  refractive  index  or  the  absorption 
coefficient).  Assuming  for  the  moment  that  the  diffraction  process  is  linear,  each  input 
point  will  be  interconnected  with  a  number  of  output  points  as  determined  by  the  set 
of  recorded  gratings.  Likewise,  each  output  point  will  be  interconnected  with  a  specified 
number  of  input  points.  Each  interconnection  will  be  weighted  by  its  diffraction  efficiency 
cis  determined  by  Equation  (18),  which  is  in  turn  dependent  on  the  index  of  refraction  (or 
absorption  coefficient)  variation  recorded  for  each  grating.  As  such,  the  holographic  optical 
element  acts  as  a  multi-port  variable  beamsplitter,  redirecting  (diffracting)  a  given  fraction 
of  each  input  beam  to  a  specified  set  of  output  beams.  By  employing  lenses  as  described 
above,  this  feature  allows  the  construction  of  a  point-to-point  interconnection  with  weights 
and  arbitrary  fan-out /fan-out  (delimited  only  by  the  number  of  gratings  recorded). 

There  is  at  least  one  obvious  problem  with  the  interconnection  scheme  outlined  above, 
however,  in  that  any  given  grating  will  connect  any  of  the  input  points  to  specific  output 
points  pairwise,  as  shown  by  Equation  (15).  This  particular  feature  occurs  because  each 
input  point  generates  a  collimated  beam  with  a  distinct  wave  vector  k,  corresponding  to 
a  pcirticular  direction  (angle)  of  propagation,  each  of  which  satisfies  Equation  (15)  with  a 
different  diffracted  wave  vector  (for  each  diffracted  order)  k,,,.  The  result  of  this  degeneracy 
is  that  any  recorded  hologram  that  is  designed  to  connect  a  single  input  point  to  one  or 
more  output  points  will  in  fact  also  connect  every  other  input  point  to  corresponding  sets 
of  output  points,  using  the  same  relative  interconnection  pattern  for  each  input  point. 
This  effect  can  be  utilized  to  advsuitage,  for  example,  in  parallel  digital  optical  computing 
systems  with  interconnection  symmetry  or  regularity,  since  one  simple  hologrsun  can  in 
effect  implement  a  very  large  number  of  point-to-point  interconnections  (the  equivalent  of 
wires  in  the  case  of  an  electronic  implementations)  [Jenkins,  1984].  For  neural  networks, 
however,  the  common  requirement  of  nearly  arbitrary  (highly  irregular)  interconnections 
makes  this  feature  undesirable. 


A  second  problem  with  the  proposed  interconnection  scheme  is  the  presence  of  a  mul¬ 
tiplicity  of  diffracted  orders  for  each  diffraction  grating,  ais  shown  in  Figure  15.8,  which 
occasions  the  connection  of  each  input  point  to  a  number  of  geometrically  related  output 
points  even  for  the  case  of  a  single  stored  grating. 

The  solution  to  this  seeming  dilemma  is  to  extend  the  holographic  medium  into  the 
third  dimension  (the  direction  of  light  propagation),  creating  a  volume  holographic  optical 
element  (VHOE)  to  take  the  place  of  the  thin  planar  element  discussed  above.  There  are 
two  essential  properties  of  VHOEs  that  bear  directly  on  the  utilization  of  such  elements  in 
photonic  interconnections.  The  first  is  that  diffraction  is  limited  to  the  first  order  only  and 
all  higher  diffracted  orders  are  suppressed  if  the  holographic  medium  is  thick  enough,  as 
defined  below  and  as  shown  schematically  in  Figure  15.10.  This  occurs  because  each  addi¬ 
tional  “layer”  in  the  thickness  direction  of  the  holographic  medium  provides  an  additional 
constraint  on  the  diffraction  phenomenon;  these  constraints  act  collectively  to  enhance 
the  amplitude  diffracted  into  the  first  order  by  means  of  constructive  interference,  at  the 
expense  of  the  other  diffracted  orders. 


FIGURE  15.10  Volume  holographic  recording  with  plane  wave  sign2d  amd 
reference  beams;  (a)  recording,  and  (b)  reconstruction,  showing  the  elimination 
of  the  higher  diffracted  orders. 


The  second  important  property  of  a  volume  holographic  optical  element  is  that  of 
angular  selectivity,  specifically,  the  range  of  input  angles  that  cam  diffract  from  a  given 
grating  decreaises  as  the  thickness  of  the  grating  is  increaised.  The  central  angles  that  are 
allowed  in  the  caise  of  a  thick  grating  are  the  same  angles  that  define  the  two  beaims  that 
initiailly  created  the  holographic  grating.  This  property  therefore  eliminates  the  inadvertent 
connection  of  all  input  points  pairwise  to  a  matching  set  of  output  points,  and  allows  for 
the  generation  of  independent,  weighted  interconnections  as  are  desired  for  neural  network 
applications. 
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In  order  to  differentiate  “thin”  grating  diffraction  behavior  (the  so-caJled  Raman-Nath 
diffraction  regime)  from  “thick”  grating  behavior  (the  so-called  Bragg  diffraction  regime), 
it  is  convenient  to  define  a  dimensionless  “thickness”  parameter  Q  such  that: 


Q  = 


2irXd 

nfi% 


(19) 


in  which  n  is  the  average  refractive  index  of  the  holographic  recording  medium,  amd  the 
remaining  parameters  are  as  specified  previously.  In  general,  gratings  for  which  >  10 
operate  well  within  the  Bragg  regime,  while  gratings  with  Q  parameters  less  than  unity 
exhibit  unacceptable  degrees  of  Raman-Nath  character  for  truly  independent  multiplexed 
interconnection  applications.  The  angular  response  characteristics  of  both  planar  and  vol¬ 
ume  diffraction  gratings  are  shown  as  a  function  of  the  Q  parameter  in  Figure  15.11,  in 
which  the  transition  from  pure  Raman-Nath  to  pure  Bragg  behavior  for  increasing  values 
of  Q  can  be  seen.  Note  that  the  number  of  input  points  that  can  be  independently  con¬ 
nected  to  an  equally-sized  array  of  output  points  is  a  decrecising  function  of  the  width  of 
the  angular  response. 


FIGURE  15.11  The  angular  alignment  sensitivity  of  a  volume  holographic 
optical  element,  as  a  function  of  the  dimensionless  Q-parameter  defined  in  the 
text.  The  grating  strength  for  all  of  the  curves  (3.14  radians)  is  optimized  to 
produce  100%  diffraction  efficiency  in  the  limit  of  large  Q  (Bragg  diffraction 
regime),  and  is  not  optimized  for  low  Q  gratings.  Note  that  the  diffraction 
efficiency  is  essentially  independent  of  angle  for  low  Q  gratings,  and  is  very 
strongly  peaked  at  the  Bragg  angle  (7.5  degrees  in  this  case)  for  high  Q  grat¬ 
ings. 


The  throughput  efficiency  of  a  volume  holographic  optical  element  as  used  in  an  in¬ 
terconnection  application  is  determined  to  first  order  by  the  diffraction  efficiency  of  each 
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individual  interconnection  grating,  in  direct  analogy  to  the  definition  of  the  diffraction 
eflSciency  for  the  planar  hologram  case  in  Equation  (18),  For  example,  for  the  case  of  an 
unslanted  pure  phase  grating  with  equiphase  fronts  (t.e.  planes  of  constant  phase)  parallel 
to  the  bisector  of  the  recording  beams  with  wave  vector  k  and  perpendicular  to  the  en¬ 
trance  face  of  the  volume  holographic  recording  medium,  the  diffraction  efficiency  at  the 
Bragg  (optimum  readout)  angle  is  given  by  [Kogelnik,  1969]: 


in  which  q  is  the  absorption  coefficient  of  the  holographic  recording  medium  of  thickness 
d  at  the  optical  readout  wavelength  A,  An  is  the  amplitude  of  the  refractive  index  modu¬ 
lation,  and  6b  is  the  Bragg  angle  defined  by  2fcsin^B  =  Kg-  As  can  be  seen  from  Equation 
(20),  the  diffraction  efficiency  of  the  first  order  for  a  single  grating  can  approach  100%  if 
the  absorption  coefficient  satisfies  the  requirement  ad  1,  provided  sufficient  index  mod¬ 
ulation  An  can  be  produced  by  the  exposure  process.  The  dependence  of  the  diffraction 
efficiency  on  the  grating  strength  is  shown  in  Figure  15.12  for  both  thin  (Raman-Nath) 
and  thick  (Bragg)  pure  phase  diffraction  gratings.  The  grating  strength  v  is  defined  as  the 
integrated  peak  phase  modulation  of  the  grating  in  each  case,  and  is  given  by: 

2'jrAnd 

V  =  - . 

A  COS^B 

The  maximum  diffraction  efficiency  of  the  thin  diffraction  grating  is  about  34%,  which 
occurs  at  a  grating  strength  of  1.8  radians.  Thick  diffraction  gratings  achieve  100%  diffrac¬ 
tion  efficiency  at  a  grating  strength  of  n  radiams,  at  which  point  the  diffraction  efficiency 
of  the  thin  grating  has  peaked  and  is  nearly  at  its  first  node,  as  shown  in  Figure  15.12. 

FIGURE  15.12  The  diffraction  efficiency  of  thin  (Raman-Nath  diffraction 
regime)  and  thick  (Bragg  diffraction  regime)  holographic  gratings  as  a  function 
of  the  grating  strength. 


The  extremely  narrow  angular  alignment  characteristics  of  volume  diffraction  gratings 
in  principle  allow  the  simultaneous  multiplexing  of  large  numbers  of  independent,  weighted 
interconnections  to  be  recorded  between  the  input  plane  and  the  output  plane  {c.f.  Figure 
15.9).  In  addition,  the  use  of  angular  multiplexing  allows  for  both  fan-out  from  a  given 
input  point  to  a  number  of  output  points,  as  well  as  fan-in  from  a  number  of  input  points 
to  a  single  output  point. 

The  holographic  implementation  of  the  fan-out  from  a  single  input  point  to  a  number 
of  output  points  uses  several  multiplexed  (superimposed)  holographic  gratings  to  achieve 
the  desired  weighted  fan-out,  one  for  each  output  point.  Consider  a  4  input,  4  output  inter¬ 
connection  as  shown  in  Figure  15.13.  For  each  input  point  Xj  that  we  wish  to  interconnect 
to  an  output  point  y'-,  the  recording  process  requires  the  pairwise  coherent  interference 
within  the  holographic  recording  medium  of  Xj  with  a  second  beam  yi  corresponding  to 
j/,-.  The  interconnection  of  xi  to  y[,y2,y-n  an<l  Va  therefore  requires  the  pairwise  coherent 
interference  of  xi  with  yi,  Xi  with  y2,  and  so  on.  This  process  results  in  the  fourfold  fan-out 
of  to  all  of  the  outputs. 

The  fan-out  from  a  single  reference  beam  to  a  number  of  output  beams  is  directly 
analogous  to  the  readout  of  a  traditional  hologram  (of,  for  example,  a  two-dimensional  or 
three-dimensional  image),  provided  that  the  full  set  of  beams  {y,}  is  coherently  record¬ 
ed  with  the  given  reference  beam  Xj.  Although  up  to  this  point  we  have  formulated  the 
point-to-point  holographic  interconnection  problem  in  terms  of  collimated  (plane  wave)  in¬ 
put  and  output  beams  that  record  individual  diffraction  gratings  (characterized  by  a  single 
grating  wave  vector)  within  the  holographic  recording  medium,  many  alternative  recording 
and  reconstruction  geometries  can  be  envisioned  that  produce  equivalent  results.  In  the 
case  of  traditional  holography,  for  example,  the  input  transparency  bearing  the  image  to  be 
recorded  is  illuminated  with  a  collimated  beam,  resulting  in  a  complex  diffraction  pattern 
at  the  front  entrance  plane  of  the  holographic  recording  medium.  Collimated,  converging, 
or  diverging  reference  beams  can  be  utilized  to  produce  reconstructed  images  with  a  wide 
variety  of  optical  imaging  characteristics.  Likewise,  various  input  and  output  beam  geome- 


tries  can  be  used  in  a  point-to-point  interconnection  system  to  optimize  the  overall  system 
characteristics,  such  as  freedom  from  interchannel  crosstalk,  optimum  use  of  the  spatial 
frequency  recording  characteristics  of  the  holographic  recording  medium,  optical  system 
complexity,  and  convenience  of  the  optical  layout  (particularly  when  viewed  in  conjunction 
with  associated  optical  subsystems). 


FIGURE  15.13  Schematic  representation  of  a  4  input,  4  output  holographic 
interconnection,  showing  4  coherent  input  beams  11-Z4  and  4  coherent  record¬ 
ing  beams  yi-y4,  each  of  which  corresponds  to  a  desired  output  In  (a),  the 

sets  {zj}  and  {y,}  interfere  within  the  volume  holographic  medium,  recording 
the  desired  interconnection  diffraction  gratings.  In  (b),  a  new  set  of  input  beam- 
s  {zj}  illuminates  the  volume  holographic  medium,  reading  out  the  weighted 
interconnection  pattern  and  forming  appropriately  weighted  sums  at  each  of 
the  outputs  {y,-}. 


The  fourfold  fan-in  of  inputs  zi,Z2,X3,  and  Z4  to  yi  can  likewise  be  accomplished  by 
recording  each  of  the  necessary  interconnections  pairwise,  as  before  for  the  fan-out  case. 
The  recording  process  for  the  fully  implemented  4  to  4  interconnection  therefore  involves 
the  generation  of  16  individually  weighted  diffraction  gratings  that  connect  the  full  set  of 
inputs  {xj}  to  the  full  set  of  outputs  {y,-}.  The  multiplexed  hologram  that  accomplishes 
this  function  can  be  recorded  in  a  number  of  ways,  each  characterized  by  certain  advantages 
and  disadvantages  [Psaltis,  1988]. 

In  the  fully  coherent  approach,  the  requisite  gratings  can  be  recorded  by  illuminating 
the  holographic  recording  medium  with  {zj}  and  with  {y^}  simultaneously.  This  can  be 
accomplished,  for  example,  by  using  a  spatial  light  modulator  to  store  each  of  the  sets  of 
values,  and  a  pair  of  mutually  coherent  readout  beams  to  encode  these  values  and  interfere 
them  within  the  holographic  element.  In  this  manner,  all  of  the  required  gratings  are 
recorded  in  a  single  exposure;  however,  there  are  two  difficulties  inherent  in  this  single 


exposure,  fully  coherent  approach.  The  first  problem  is  that  a  fully  independent  N  to  M 
interconnection  requires  NM  stored  interconnection  weights,  whereas  the  single  exposure 
described  above  supplies  only  N-\-M  input  values  that  can  be  used  to  generate  the  weights. 
The  resulting  interconnei.tion  matrix  can  in  fact  connect  all  of  the  input  points  to  all  of  the 
output  points,  but  the  relative  fan-out  weights  from  each  input  point  will  be  degenerate. 
One  way  to  avoid  this  degeneracy  is  to  illuminate  the  holographic  recording  medium  with 
each  input  Xj  and  a  full  set  of  corresponding  outputs  {yj},  sequencing  through  aJl  N  of 
the  inputs  (and  changing  the  set  of  corresponding  outputs)  one  at  a  time.  This  procedure 
generates  an  independent  fan-out  from  each  input  point.  The  second  problem  with  the 
single  exposure,  fully  coherent  approach  is  that  undesirable  gratings  will  be  recorded  among 
the  {ij}  and  among  the  {y,}  that  can  lead  to  considerable  coherent  crosstalk  among  the 
desired  interconnection  pairs.  This  coherent  interference  process  diminishes  the  degree  of 
independence  of  the  interconnections. 

This  coherent-recording-induced  crosstalk  can  also  be  avoided  by  sequencing  the  record¬ 
ing,  but  in  this  case  each  desired  grating  pair  is  recorded  separately  such  that  only  one 
input  beam  Xj  interferes  with  one  output  beam  generator  y,  (recording  beam  for  the  desired 
output  beam  yj)  at  a  time.  This  scheme  effectively  eliminates  the  coherent  crosstalk,  but 
does  not  eliminate  another  form  of  crosstalk  (called  beam  degeneracy  crosstalk  [Jenkins, 
1990a;  Jenkins,  1990b;  Asthana,  1990c|,  the  origin  of  which  is  described  below)  that  can 
be  equally  severe;  in  addition,  the  complication  imposed  by  the  incorporation  of  such  a 
sequential  recording  schedule  can  be  a  serious  constraint  for  large  N  x  M  {N  input  points 
to  M  output  points)  interconnections,  as  NM  independent  recording  steps  are  required 
for  full  programming  of  the  interconnection.  This  proves  to  be  particularly  problematic 
for  the  rapid  generation  of  weight  updates  in  a  large  scale  neural  interconnection  network 
that  incorporates  synaptic  plasticity.  Furthermore,  sequential  recording  of  holographic 
exposures  can  cause  partial  erasure  of  previously  recording  interconnection  weights  in  cer¬ 
tain  types  of  holographic  recording  materials,  necessitating  the  use  of  recording  schedules 
that  attempt  to  balance  the  weights  recorded  at  the  beginning  of  the  sequence  (and  hence 
partially  erased  by  all  subsequent  exposures)  with  the  weights  recorded  at  the  end  of  the 
sequence  [Psaltis,  1988).  The  use  of  such  recording  schedules  usually  implies  an  overall 
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decrease  in  both  the  exposure  efficiency  and  throughput  efficiency  of  the  resulting  holo¬ 
graphically  recorded  interconnection  matrix,  as  well  as  the  buildup  of  noise  resulting  from 
the  series  of  space-variant  erasures. 

One  potential  scheme  for  reducing  coherent-recording-induced  crosstalk,  beam  degen¬ 
eracy  crosstalk,  and  sequential  recording  schedules  involves  the  use  of  an  array  of  coherent 
but  mutually  incoherent  sources  to  simultaneously  expose  the  holographic  recording  medi¬ 
um  to  only  the  desired  sets  of  gratings  [Jenkins,  1990a;  Jenkins,  1990b;  Asthana,  1990c]. 
This  scheme  will  be  discussed  in  detail  in  a  later  section. 

The  fan-out  process  is  illustrated  in  Figure  15.14,  in  which  implementations  using  both 
beamsplitters  and  volume  holographic  optical  elements  are  shown.  The  case  of  fan-out 
utilizing  beamsplitters  is  shown  schematically  in  Figure  15.14(a).  As  can  be  seen  in  the 
figure,  the  input  beam  can  be  divided  among  the  output  beams  with  arbitrary  weights  set 
by  the  transmissivities  of  the  beamsplitting  elements  BSi.  If  the  final  beamsplitter  is  a 
mirror,  the  fan-out  process  can  be  accomplished  with  essentially  zero  throughput  loss.  By 
ajvalogy  to  the  beamsplitter  case,  as  well  as  by  direct  analysis,  it  can  be  proven  that  the 
holographic  fan-out  process  shown  in  Figure  15.14(b)  can  also  be  accomplished  with  es¬ 
sentially  arbitrary  weights,  with  no  optical  throughput  loss  inherent  in  the  fan-out  process 
itself.  It  is  interesting  to  note  that  these  two  implementations  differ  in  at  least  one  essen¬ 
tial  feature,  in  that  the  beauns  fanned  out  from  the  holographic  implementation  originate 
within  the  same  volume,  while  the  beams  fanned  out  from  the  beamsplitter  implementa¬ 
tion  originate  from  vertically  displaced  beamsplitters.  If  we  were  to  extend  the  fanned  out 
beams  in  the  latter  case  backwards  toward  the  left  hand  side  of  Fig.  15.14(a),  we  could 
imagine  replacing  the  three  discrete,  vertically  displaced  beamsplitters  with  a  single,  mul¬ 
tiplexed  “virtual”  beamsplitter  that  generates  the  same  set  of  output  beams.  One  physical 
realization  of  such  a  “virtual”  beamsplitter  component  is  in  fju:t  the  multiplexed  volume 
hologram  shown  in  Figure  15.14(b). 


FIGURE  15.14  Schematic  representation  of  the  fan-out  process  for  optical 
beams,  for  the  case  of  one  input  and  three  outputs:  (a)  with  beamsplitters 


{BSi  •  SS3);  (b)  with  a  single  holographic  optical  element  containing  three 
multiplexed  (spatially  superimposed)  diffraction  gratings. 


The  collinear  fan-in  process  is  illustrated  in  Figure  15.15  for  both  types  of  implemen¬ 
tations.  As  was  discussed  above,  for  the  beamsplitter  implementation  an  intrinsic  fan-in 
loss  is  encountered  for  the  caise  of  collinear  fan-in,  while  the  intrinsic  loss  can  be  circum¬ 
vented  by  resorting  to  mirrors  and  employing  angular  multiplexing.  For  the  case  of  volume 
holographic  optical  elements,  the  situation  is  identical,  such  that  collinear  fan-in  is  grossly 
inefficient  for  large  numbers  of  fan-in  interconnections  to  the  same  node.  On  the  other 
hand,  appropriate  use  of  angular  multiplexing  can  eliminate  this  seemingly  inherent  fan-in 
loss,  giving  rise  to  a  highly  multiplexed,  efficient  interconnection  element  [Jenkins,  1990a; 
Jenkins,  1990b;  Asthana,  1990c]  as  described  in  a  later  section. 


FIGURE  15.15  Schematic  representation  of  the  fan-in  process  for  optical 
beams,  for  the  case  of  three  angularly  distinct  inputs  and  one  combined  collinear 
output  beam:  (a)  with  beamsplitters,  showing  the  unavoidability  of  a  through¬ 
put  loss  associated  with  the  set  of  transmitted  (and  multiply  reflected)  beams; 
(b)  with  a  single  holographic  optical  element  containing  three  multiplexed  (s- 
patially  superimposed)  diffraction  gratings,  showing  am  analogous  throughput 
loss. 


The  physical  origin  of  this  intrinsic  optical  throughput  loss  in  the  case  of  collinear 
fan-in  is  directly  related  to  the  mechanism  that  gives  rise  to  beam  degeneracy  crosstalk. 
In  Figure  15.16  we  show  a  4  to  4  holographic  interconnection  that  is  assumed  to  have 
been  recorded  by  the  sequential  exposure  technique  described  above  in  reference  to  Figure 
15.13,  in  order  to  include  all  16  individually  weighted  interconnection  gratings  but  none 
of  the  undesirable  gratings  that  can  give  rise  to  coherent-recording-induced  crosstalk.  In 


this  case,  readout  by  the  input  beam  xi  generates  the  four  output  beams  yj  throtigh 
with  values  given  by  the  stored  interconnection  weights  Wij: 

y-  =  (22) 

Within  the  holographic  medium,  however,  each  of  the  four  output  beams  cam  in  turn 
act  as  an  input  beam,  generating  undesired  output  beams  in  the  directions 
I4.  These  undesired  output  beams  are  a  result  of  diffraction  from  the  gratings  recorded 
between  each  output  generating  beam  yj  and  the  full  set  of  input  beams  {x_,  }.  Each  output 
beam  is  automatically  Bragg  matched  (at  the  correct  Bragg  angle)  to  the  full  set  of  input 
beams  due  to  the  collinear  recording  geometry  employed.  We  refer  to  the  fan-in  as  collinear 
in  this  case  because  each  input  beam  Xj  that  is  fanned  in  to  a  given  output  y,-  produces  an 
output  beam  in  the  same  direction.  The  generation  of  diffracted  intensity  in  the  directions 
ij-xi  from  readout  with  xi  results  in  a  throughput  loss  for  the  interconnections  between 
ii  and  the  set  of  output  beams  {y,-}.  In  addition,  the  throughput  losses  of  the  individual 
output  beams  {y,-}  will  not  be  equal  in  genejal.  Furthermore,  the  undesired  diffracted 
beams  x^x^  can  also  act  as  input  beams,  generating  additional  output  bejims  in  the  direc¬ 
tions  {y,-}  that  coherently  interfere  with  the  beams  directly  diffracted  in  those  directions 
by  the  input  beam  Xi.  The  combination  of  interconnection-dependent  losses  from  the  out¬ 
put  beams  {yJ}  into  the  “cross-coupled”  beams  {xJ},  and  of  interconnection-dependent 
coupling  from  into  {yJ}  gives  rise  to  an  undesired  redistribution  of  the  intensities  of 
the  output  beams.  This  phenomenon  is  referred  to  as  beam  degeneracy  crosstalk,  as  it 
arises  from  the  beam  direction  degeneracy  (collinearity)  of  the  output  beams  fanned  into 
a  single  output  point. 


FIGURE  15.16  Illustration  of  the  generation  of  crosstalk  in  holographic 
optical  interconnections  due  to  beam  degeneracy:  recording/readout  config¬ 
uration.  The  input  beams  {xj}  are  assumed  to  have  interfered  within  the 
volume  holographic  medium  with  the  set  of  recording  beams  {y,},  producing 


the  desired  set  of  interconnection  gratings  with  weights  Wij.  Illumination  of 
the  volume  holographic  medium  with  beam  xi  produces  a  1  to  4  fanout  into 
the  output  beams  {j/,-},  as  well  as  the  zeroth  order  beam  x\.  Due  to  the  effects 
of  beam  degeneracy,  power  is  also  coupled  into  the  zeroth  order  beams  xyx^, 
and  crosstalk  terms  {c,}  axe  introduced  into  the  outputs. 


Both  the  throughput  loss  and  the  beam  degeneracy  crosstalk  that  chairaeterize  holo¬ 
graphic  interconnection  geometries  with  collineax  fan-in  can  be  estimated  by  numerical 
simulation  oi  the  diffraction  process  from  a  multiplexed  grating  [Asthana,  1990c].  By 
using  the  optical  beam  propagation  method  [Johnson,  1986]  to  simulate  the  diffraction 
process,  we  can  analyze  the  4  to  4  interconnection  described  above  for  the  case  of  a  single 
beam  readout,  as  shown  in  Figure  15.16.  The  results  of  such  an  analysis  are  presented  in 
Figure  15.17,  which  shows  the  diffraction  efficiency  of  each  of  the  four  beams  fanned  out 
from  the  single  input  point,  as  well  as  the  three  cross-coupled  beams  in  the  directions  {x' } 
and  the  zero  order  (undiffracted)  beam.  For  this  illustration,  all  16  interconnection  weights 
are  equal  in  magnitude.  As  the  grating  strength  is  increased,  a  significant  amount  of  inten¬ 
sity  is  coupled  into  the  cross-coupled  components,  robbing  the  desired  fan-out  beams  of  the 
desired  diffraction  efficiency.  In  addition,  the  relative  diffrew:tion  efficiencies  observed  in  the 
designated  fan-out  beams  are  no  longer  independent  of  the  grating  strength,  as  desired  in 
a  fully  independent  weighted  interconnection.  Extensive  modeling  of  N-to-N  holographic 
interconnections  with  collinear  fan-in  suggests  that  the  throughput  loss  increases  approxi¬ 
mately  as  1/A^,  which  is  potentially  catastrophic  for  large  interconnection  networks.  In  a 
later  section,  we  will  describe  an  alternative  holographic  recording  approach  that  obviates 
this  1/^  loss. 


FIGURE  15.17  Illustration  of  the  generation  of  crosstalk  in  holographic  op¬ 
tical  interconnections  due  to  beam  degeneracy:  diffraction  efficiency  as  a  func¬ 
tion  of  grating  strength  for  the  readout  configuration  of  Figure  15.16.  Shown 
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are  the  depletion  of  the  zero  order  beam  x\  and  the  rise  of  the  desired  output 
beams  y,-,  accompanied  by  a  strong  buildup  of  the  cross-coupled  beams 
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The  development  of  a  viable  photonic  interconnection  technology  is  based  in  no  s- 
mall  part  on  the  availability  of  appropriate  photosensitive  recording  materials  [Psaltis, 
1988;  Smith,  1977;  Gunter,  1988;  Gunter,  1989].  Many  interconnection  demonstration 
experiments  have  been  performed  in  the  laboratory  on  bleached  photographic  emulsions 
and  dichromated  gelatin  films,  both  of  which  are  thick  enough  (10-30  /xm)  to  exhibit 
sufficient  Bragg-like  diffraction  behavior  to  allow  a  limited  degree  of  multiplexing  to  be 
incorporated.  Neither  material,  however,  exhibits  capacity  for  real  time  operation,  which 
is  essential  for  the  implementation  of  photonic  neural  networks  with  at  least  some  degree 
of  synaptic  plasticity.  On  the  other  hand,  one  principal  advantage  of  photographic  film 
and  dichromated  gelatin  is  their  essentially  infinite  read-write  asymmetry,  which  is  highly 
desirable  in  many  applications  as  described  below. 

By  Ttal  time  operation,  we  mean  that  the  holographic  interconnections  can  be  pro¬ 
grammed  (exposed)  and  used  (read  out)  on  roughly  the  same  time  scale  (perhaps  at  kHz 
frame  rates),  without  the  necessity  of  chemical  development  processes  or  the  like.  By  read- 
write  asymmetry,  we  mean  that  the  readout  of  a  programmed  interconnection  should  not 
erase  the  stored  weights  at  an  accumulated  readout  exposure  equal  to  that  of  the  recording 
exposure.  Ideally,  we  would  like  to  have  the  capability  of  exposing  the  holographic  inter¬ 
connection  to  the  recording  beams  with  essentially  instantaneous  “development”  of  the 
stored  gratings,  with  the  recording  process  characterized  by  very  high  sensitivity  during 
the  “learning”  process.  At  the  same  time,  we  would  like  to  be  able  to  initiate  readout  of 
the  stored  interconnection  pattern  without  altering  the  stored  weights  for  a  length  of  time 
equal  to  the  desired  “computation”  time.  Although  in  many  applications  the  learning  and 
computation  times  may  differ  by  only  an  order  of  magnitude,  in  other  cases  it  is  desirable 
to  compute  for  very  long  times  compared  with  the  learning  phase,  and  yet  still  maintain 
the  cap2u:ity  for  (slowly  varying)  weight  updates. 

The  class  of  photosensitive  recording  materials  that  has  been  most  extensively  investi- 
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gated  for  photonic  interconnection  applications  does  in  fact  have  the  capacity  for  sensitive 
holographic  recording,  is  available  in  “thick"  samples  that  allow  for  the  formation  of  Bragg- 
regime  diffraction  gratings,  exhibits  a  high  multiplexing  capacity,  and  allows  for  the  inclu¬ 
sion  of  modest  read-write  asymmetries.  This  class  is  that  of  the  so-called  “photorefractive” 
materials  [Gunter,  1988;  Gunter,  1989],  which  includes  single  crystals  of  semi-insulating 
optical  materials  such  as  bismuth  silicon  oxide  (5ti25i02o)i  bismuth  germanium  oxide 
{Bii2Ge02o),  lithium  niobate  (LiNbOs),  strontium  barium  niobate  (Sri_iBaiA*6206), 
potassium  niobate  (KNbOz)  and  barium  titanate  (BaTiOs),  as  well  as  semi-insulating 
semiconductors  such  as  gallium  arsenide  (GaAs),  indium  phosphide  (InP)  and  cadmi¬ 
um  telluride  (CdTe).  The  use  of  the  term  “photorefractive”  to  describe  these  materials 
exclusively  is  somewhat  misleading,  in  that  many  other  classes  of  materials  are  known 
to  undergo  a  refractive  index  change  following  illumination  cis  well  as  those  traditionally 
included  in  the  class  described  above.  But  at  least  the  term  is  descriptive  of  the  basic 
phenomenon  involved,  as  outlined  below. 

In  photorefractive  materials  such  as  bismuth  silicon  oxide,  exposure  to  an  interference 
pattern  at  an  appropriate  wavelength  (characterized  by  significant  photosensitivity)  gen¬ 
erates  free  charge  carriers  (electrons  or  holes)  liberated  from  deep  traps.  The  number  of 
photogenerated  carriers  is  in  general  proportional  to  the  local  intensity  absorbed  by  the 
crystal;  as  such,  the  photogenerated  carrier  population  mimics  the  exposure  pattern  in 
both  amplitude  and  phase.  The  photogenerated  carriers  are  free  to  diffuse  to  regions  of 
lower  intensity,  or  they  can  be  assisted  out  of  the  brightest  regions  by  application  of  a  bias 
electric  field  to  produce  carrier  drift.  In  either  case,  they  tend  to  be  retrapped,  in  turn 
creating  a  space  charge  distribution  that  has  the  same  spatial  frequency  as  the  interfer¬ 
ence  pattern.  This  space-variant  space  charge  distribution  produces  a  locally  modulated 
electric  field  with  the  same  spatial  frequency  (as  determined  by  the  grating  spacing  or 
grating  wavelength),  which  in  turn  induces  a  local  change  in  the  refractive  index  of  the 
photorefractive  material  through  the  linear  (Pockels)  or  quadratic  (Kerr)  electrooptic  ef¬ 
fect  [Kaminow,  1974].  The  refractive  index  grating  can  then  be  probed  by  a  readout  beam 
to  generate  a  diffracted  beam,  just  as  in  the  case  of  the  pure  phase  gratings  described 
previously. 


An  excellent  set  of  review  articles  on  the  physical  properties  and  applications  of  pho- 
torefractive  materials  has  been  assembled  by  Gunter  and  Huignard  [Gunter,  1988;  Gunter, 
1989].  The  state  of  the  art  is  such  that  1  cm^  crystals  of  many  of  these  materials  have 
been  grown,  and  shown  to  exhibit  a  very  high  degree  of  optical  quality.  Exposure  sensi¬ 
tivities  vary  widely,  but  several  crystals  require  of  order  500  nJ/crn^  for  full  exposure  to 
saturation  (the  highest  grating  strength  that  can  be  achieved  in  that  particular  crystal). 
This  corresponds  to  the  absorption  of  about  50  mWfcm^  of  optical  intensity  throughout 
1  cm^  of  material  for  an  exposure  period  of  10  msec.  The  range  of  spatial  frequencies  that 
Ccin  be  supported  in  these  materials  ranges  from  a  few  lines/mm  to  over  2000  lines/mm. 
Diffraction  efficiencies  close  to  100%  have  been  observed  in  several  types  of  crystals,  while 
others  saturate  nearer  to  10%  for  thicknesses  of  order  1  cm. 

Optimization  of  photorefractive  materials  for  interconnection  device  applications  is  un¬ 
der  way,  including  the  development  of  growth  processes  for  large  photorefractive  crystal 
boules  with  a  high  degree  of  optical  uniformity;  the  characterization  of  both  unintentionally 
incorporated  impurities  and  intentionally  incorporated  dopants  that  alter  the  holographic 
recording,  readout,  and  storage  characteristics;  the  use  of  applied  d.c.  and  a.c.  bias  elec¬ 
tric  fields  to  enhance  the  holographic  recording  sensitivity;  the  use  of  polarization  effects 
to  enhance  the  reconstructed  image  signal-to-noise  ratio;  and  the  antireflection  coating  of 
the  (typically  high  index)  front  and  rear  surfaces  to  increase  the  diffraction  efficiency  eind 
avoid  the  presence  of  unwanted  gratings  due  to  multiple  reflections  [Karim,  1988;  Kcirim, 
1989a;  Karim,  1989b].  In  addition,  the  origin  of  electric  field  nonuniformities  that  occur 
within  photorefractive  crystals  during  grating  recording  is  under  £w:tive  investigation,  and 
several  methods  of  eliminating  the  field  collapse  have  been  discovered  [Herbulock,  1988]. 
Use  of  these  methods  increases  both  the  saturation  diffraction  efficiency  and  the  grating 
response  time,  resulting  in  more  efficient  interconnection  devices  that  operate  at  higher 
recording  sensitivities. 


Sources  and  Source  Arrays 


In  reviewing  a  large  fraction  of  the  journal  articles  published  over  the  past  decade 
on  optical  information  processing  and  computing,  including  the  most  recent  coverage  of 
photonic  implementations  of  neural  networks,  you  will  be  inspired  perhaps  by  the  cleverness 
of  a  particular  proposed  architecture,  or  intrigued  by  the  novel  features  of  a  particular 
device  structure.  But  you  will  also  be  amaized  at  the  apparent  lack  of  emphasis  on  certainly 
one  of  the  most  fundamental  component"  in  any  proposed  photonic  computational  system: 
the  source  of  the  light!  This  oversight  may  be  caused  in  pMt  by  direct  analogy  to  the 
situation  in  VLSI  electronics,  in  which  it  is  a  bit  unglamorous  (and  probably  also  to  a 
certain  extent  unnecessary)  to  concentrate  on  the  battery  or  the  power  supply.  After 
all,  electrical  power  is  relatively  inexpensive,  widely  available,  well  characterized,  and 
reasonably  abundant.  At  peak  usage,  your  home  probably  uses  about  10  kW ,  most  of 
which  is  dissipated  in  the  air  conditioner. 

However,  the  situation  in  photonic  technology  is  quite  different.  Sources  of  coherent 
optical  radiation  that  can  produce  average  output  powers  in  the  10  kW  range  exist  in 
only  a  few  laboratories,  are  very  large  (about  15  m^),  usually  emit  in  the  far  infrared 
(10.6  /im),  and  are  far  from  inexpensive.  Incoherent  sources  in  the  range  of  100  -  1000  W 
are  available  (xenon- mercury  (Xe-Hg)  g2is  discharge  lamps,  for  example),  but  this  type  of 
source  is  typically  noisy  (exhibits  large  intensity  fluctuations),  difficult  to  collimate,  and 
characterized  by  a  very  short  lifetime  (from  the  systems  perspective).  In  addition,  gas 
discharge  lamps  are  broadband  sources,  and  as  such  usujdly  require  wavelength  filtering 
in  order  to  provide  compatibility  with  wavelength  sensitive  devices  such  as  volume  holo¬ 
graphic  optical  elements  and  spatial  light  modulators.  A  broadb2ind  source  that  has  been 
suitably  filtered  to  £dlow  readout  of  a  typical  volume  holographic  optical  element  (within 
the  allowable  spectral  bandwidth  of  the  stored  diffraction  gratings)  might  generate  only 
about  10“*  -  10“®  of  its  totad  rated  power  in  the  wavelength  region  of  interest.  For  the 
1000  W  Xe-Hg  lamp,  this  results  in  only  about  1-10  mW  of  quasi-monochromatic  optical 


Coherent,  monochromatic  optical  power  can  be  provided  by  an  array  of  different  types 
of  laser  sources  [Milonni,  1988],  including  the  argon-ion  (Ar"^)  laser,  the  neodymium-YAG 
(Nd-YAG)  laser,  the  helium  neon  (He-Ne)  laser,  the  helium  cadmium  (He-Cd)  laser,  dye 
lasers,  excimer  lasers,  and  semiconductor  laser  diodes.  Typical  monochromatic  (single  laser 
line)  power  outputs  from  the  first  two  types  range  from  about  500  mW  to  25  W.  Helium 
neon  and  helium  cadmium  lasers  are  readily  available  as  well  as  relatively  inexpensive,  but 
have  output  powers  that  are  typically  in  the  range  1-5  mW,  peaking  out  at  about  50  mW. 
Dye  lasers  are  often  optically  pumped  by  argon-ion  lasers,  and  hence  exhibit  power  outputs 
slightly  lower  than  that  of  the  pump  laser.  Excimer  lasers  are  typically  operated  in  the 
pulsed  mode  of  operation  at  repetition  rates  of  10  -  1000  pulses  per  second,  and  emit 
average  powers  in  the  10  -  100  W  range.  Finally,  semiconductor  laser  diodes  are  available 
with  very  long  lifetimes  at  output  powers  of  1  -  20  mVF,  and  much  shorter  lifetimes  in  the 
100  mW  -  I  W  range. 

Of  these  six  different  types  of  coherent  sources,  the  first  five  are  still  relatively  bulky 
(about  0.1  m^),  consume  considerable  electrical  power,  generate  significant  amounts  of 
heat  (many  must  be  water  cooled  to  ensure  stable  operation  and  practical  lifetimes),  and 
are  very  expensive  (especially  when  compared  with  a  comparable  electronic  power  sup¬ 
ply!).  Although  these  sources  can  be  (and  indeed  are)  employed  in  current  systems-level 
demonstrations,  their  collective  liabilities  do  not  augur  well  for  their  eventual  incorpora¬ 
tion  in  commercially  viable  computational  systems  in  general,  and  perhaps  neural  network 
applications  in  particular.  This  leaves  the  last  category,  that  of  semiconductor  diode  lasers 
(including,  possibly,  miniaturized  diode-pumped  Nd-  YAG  lasers),  for  further  consideration. 

Before  discussing  the  properties  of  semiconductor  diode  leisers  as  optical  power  sources 
any  further,  we  should  at  least  note  that  the  ramge  of  output  powers  available  from  these 
sources  (1  -  100  mW  for  single  element  devices)  is  rather  limited.  Taking  an  upper  bound 
(with  continued  research  and  development)  of  about  a  watt  per  device  gives  us  a  realistic 
estimate  of  the  amount  of  average  coherent  source  power  available  for  at  least  circuit  level 
implementation  of  photonic  neural  networks,  though  certainly  at  the  systems  level  phcised 
arrays  of  stripe  laser  diodes  and/or  multiple  sources  could  conceivably  be  employed. 

Semiconductor  diode  lasers  [Kressel,  1977;  Casey,  1978a;  Casey,  1978b]  have  been  ex- 


tensively  investigated  and  developed  over  the  past  two  decades  for  a  broad  range  of  com¬ 
mercial  applications,  including  compact  disk  player  recording  and  readout,  fiber  optical 
communications  systems  [Jones,  1988],  merchandise  optical  scanners,  and  laser  printers. 
The  physical  size  of  these  lasers  is  sm^lll  enough  (about  0.3  x  1  x  5  mm)  to  fit  in  a  standard 
transistor  (or  IC)  package,  as  long  as  external  cooling  is  not  required.  Lasers  with  power 
outputs  of  1  -  10  mW  are  relatively  inexpensive,  costing  a  few  tens  of  dollars  in  quantity 
on  the  average.  Higher  output  power  lasers  are  considerably  more  expensive,  however,  as 
cire  lasers  with  very  narrow  spectral  linewidths  (so-called  single  longitudinal  mode  lasers). 
For  the  higher  power  lasers  (as  well  as  for  the  intermediate  power  lasers  that  are  required 
to  maintain  a  high  degree  of  center  wavelength  accuracy),  external  cooling  {e.g.  by  means 
of  a  thermoelectric  cooler)  must  be  provided  in  order  to  maintain  thermal  stability  in  both 
wavelength  and  output  power. 

The  wavelength  ranges  spanned  by  semiconductor  diode  lasers  are  dictated  by  the 
direct  bandgap  materials  used  to  fabricate  the  coherent  light-emitting  diode  (semiconduc¬ 
tor  p-n  junction).  Aluminum  gallium  arsenide/gallium  arsenide  (AlGaAs/GaAs)  laisers 
grown  on  single  crystal  gallium  arsenide  substrates  emit  at  wavelengths  in  the  range  780 
to  900  nm,  while  lasers  based  in  the  quaternary  indium  gallium  arsenide  phosphide  {In- 
GaAsP)  compound  semiconductor  system  (and  grown  on  indium  phosphide  substrates) 
emit  at  wavelengths  further  into  the  infrared  (1.2  to  1.6  pm).  The  eJuminum  gallium 
arsenide/gallium  arsenide  lasers  in  p2U'ticular  are  nearly  wavelepgth  matched  to  the  peak 
sensitivity  of  both  silicon  and  gallium  arsenide  photodetectors,  as  might  be  employed  for 
photonic  switching  in  spatial  light  modulator  arrays,  or  for  detection  of  computed  results 
in  a  system  diagnostic  or  output  plane. 

Within  these  ranges,  a  typical  multimode  semiconductor  diode  laser  has  a  spectral 
bandwidth  of  0.5  -  2  nm;  a  single  longitudinal  mode  laser  has  a  much  narrower  spectral 
bandwidth  of  order  10“^  nm  (about  50  MHz  centered  at  an  optical  frequency  of  3.5  x 
10*^  Hz).  Both  multimode  and  single  longitudinal  mode  diode  lasers  can  be  used  to 
write  aind  read  holographic  optical  interconnection  elements,  as  long  as  the  coherence 
length  of  the  laser  is  larger  than  the  thickness  of  the  holographic  recording  medium.  The 
coherence  length  of  a  laser  is  essentially  the  maximum  path  difference  over  which  two 


beams  derived  from  the  same  laser  can  maintain  the  stable  ph<ise  relationship  necessciry  to 
exhibit  an  interference  pattern.  In  applications  requiring  high  multiplexing  capacity  within 
the  holographic  interconnection  medium  (or  significant  path  differences  among  beams  that 
must  coherently  interfere),  the  narrower  linewidths  of  the  single  longitudinal  lasers  are 
often  preferable  since  their  coherence  lengths  are  several  orders  of  magnitude  longer.  For 
example,  typical  multimode  semiconductor  diode  lasers  operated  above  threshold  exhibit 
coherence  lengths  in  the  range  0.1-10  mm,  while  stabilized  single  longitudinal  mode  diode 
lasers  can  have  coherence  lengths  exceeding  1  m. 

Employing  a  single,  high  intensity  optical  power  source  in  a  typiccil  neural  network  ap¬ 
plication  carries  with  it  a  potential  penalty;  an  inherent  tradeoff  between  energy  efficiency 
on  the  one  hand,  and  the  need  for  array  generation  optics  on  the  other.  This  tradeoff 
tirises  from  the  fact  that  most  optoelectronic  implementations  of  neuron  unit  arrays  have 
either  photodetectors  or  modulation  windows  (in  some  cases  both)  that  are  smaller  in  size 
than  eaw:h  individual  pixel,  as  was  shown  schematically  in  Figures  15.4  and  15.7.  The  ratio 
of  the  area  of  a  given  photosensitive  element  to  the  entire  pixel  area  is  referred  to  as  the 
fill  factor  of  the  pixel  (with  respect  to  that  particular  element).  Typicad  fill  factors  for 
the  photodetectors  and  modulation  windows  may  range  from  less  tham  0.1  in  the  case  of 
monolithic  integration  to  about  0.5  for  hybrid  integrated  devices.  Light  that  falls  outside 
the  appropriate  areas  within  a  given  pixel  will  at  best  contribute  to  the  overadl  system 
throughput  loss,  aind  at  worst  may  adversely  affect  the  function  of  adjaicent  devices  that 
exhibit  photosensitivity. 

In  order  to  efficiently  chamnel  the  optical!  illumination  to  the  correct  photosensitive 
regions,  we  need  to  (a)  expand  the  source  illumination  uniformly  to  fill  the  entire  aperture 
of  the  device  in  question  (a  spatial  light  modulator  or  volume  holographic  optical  element, 
for  example),  (b)  in  mamy  cases  collimate  (or  re-collimate)  the  light  source  to  produce  a 
plainax  wavefront  with  a  beam  of  constauit  width,  (c)  spatially  filter  the  beam  to  enhance 
its  uniformity  by  eliminating  significant  fixed-pattern  noise,  (d)  focus  the  light  within  each 
individual!  pixel  to  a  size  compatible  with  the  relevant  photosensitive  area  (in  effect  thereby 
generating  a  two-dimensional  array  of  focused  beamlets),  and  (e)  align  the  resulting  array 
of  focused  beamlets  with  each  succeeding  device  in  the  optical  path. 


The  procedures  and  opticcil  elements  required  for  beam  expansion,  collimation,  and 
^  spatial  filtering  are  well  understood  among  the  optical  community  for  the  case  in  which  the 

source  beam  is  initially  axially  symmetric,  as  is  typical  of  gas  and  excimer  laser  systems. 
In  typical  semiconductor  Iciser  diodes,  however,  the  planar  nature  of  the  light-emitting 
hetero junction  region  often  gives  rise  to  a  diffraction-induced  beam  divergence  parallel 
I  to  the  junction  of  3  -  10  degrees,  and  a  corresponding  beam  divergence  perpendicular  to 

the  junction  of  20  -  60  degrees.  Comparable  procedures  and  optical  elements  for  such 
anamorphic  (non-axially  symmetric)  beams  are  more  complex,  and  are  currently  under 
development.  Also  under  development  are  a  number  of  types  of  semiconductor  diode 
lasers  that  emit  approximately  axially  symmetric  beams  suitable  for  standard  collimation 
and  filtering  systems. 

The  optical  source  array  generation  problem  has  received  considerable  attention  re- 
H  cently,  due  primarily  to  significant  interest  in  optical  interconnection  systems.  In  one 

promising  approach,  a  two-dimensional  array  of  computer  generated  and  photographically 
reduced  amplitude-encoded  Fresnel  zone  plates  has  been  used  to  form  an  8  x  8  grid  of 
microlenses  that  function  by  means  of  diffraction  (from  what  is,  practically  speaking,  a 
^  computer  generated  hologram  (CGH))  rather  than  refraction  [Marrakchi,  1990].  In  another 

well-developed  approach,  computer  generated  binary  phase  holograms  (so-called  Damman- 
n  gratings  [Dammann,  1971)  have  been  configured  to  form  large  grid  patterns  of  regularly 
p  spaced  illuminated  spots  with  predetermined  locations  and  fill  factors  [Morrison,  1989]. 

Using  this  latter  technique,  32  x  32  arrays  have  been  generated  with  both  high  throughput 
efficiencies  and  low  scattered  light  by  crossing  two  fabricated  1  x  32  grating  arrays.  In 
addition,  an  81  x  81  array  has  been  experimentally  demonstrated  by  using  two  pairs  of 
^  crossed  1x9  grating  arrays  in  an  optical  arrangement  that  generates  multiple  images  by 

means  of  a  convolution  operation  [McCormick,  1989].  In  both  of  these  techniques,  all  of  the 
resulting  light  beamlets  are  mutually  coherent,  as  they  derive  from  the  same  source.  This 
^  mutual  coherence  has  an  impact  on  the  utilization  of  such  source  airrays  for  the  generation 

of  independent  holographic  interconnection  networks,  as  described  in  the  subsection  on 
“Photonic  Interconnections”  above. 

An  interesting  alternative  to  the  generation  of  pixelated  optical  sources  by  modification 
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of  the  properties  of  a  single  source  is  that  of  direct  fabrication  of  multiple  source  arrays. 
One  striking  example  is  the  recent  successful  fabrication  of  over  one  million  independent 
surface-emitting  semiconductor  diode  lasers  on  a  single  gallium  arsenide  chip  [Jewell,  1990]. 
Both  cylindrical  and  square  cross-section  microlasers  have  been  fabricated  with  diameters 
and  edge  dimensions  in  the  range  1  -  5  pm,  with  heights  above  the  surface  of  the  wafer  of 
about  5.5  pm  as  shown  schematically  in  Figure  15.18.  In  the  fabrication  process  employed, 
the  laser  mirrors  are  arranged  to  generate  laser  emission  through  the  500  pm  thick  gallium 
arsenide  substrate,  as  shown  in  the  Figure.  In  order  to  accomplish  this  without  significant 
absorption  in  the  substrate,  the  active  (lasing)  medium  is  composed  of  InGaAs  quantum 
wells  with  GaAs  barriers,  giving  rise  to  an  emitted  infrared  wavelength  («  950  nm)  that 
lies  in  a  region  of  substrate  transparency. 


FIGURE  15.18  Illustration  of  a  surface-emitting  laser  diode  source  array 
[after  Jewell,  1990].  In  this  example,  the  individual  semiconductor  laser  diodes 
are  isolated  by  chemically  assisted  ion  beam  etching  techniques,  must  be  indi¬ 
vidually  contacted,  and  emit  through  the  GaAs  substrate. 


In  the  present  configuration,  the  lasers  are  essentially  optically  isolated,  and  hence  are 
not  designed  to  be  mutually  coherent  (phase  locked).  In  fact,  over  time  constants  typical 
of  holographic  recording  in  currently  available  photorefractive  crystals  (milliseconds),  it  is 
likely  that  such  arrays  are  for  all  practical  purposes  mutually  incoherent,  due  both  to  the 
optical  isolation  as  well  as  to  process-induced  variations  in  device  parameters  that  alter  the 
wavelength  emitted  from  each  individual  l2iser.  Arrays  of  surface-emitting  semiconductor 
lasers  that  have  been  specifically  designed  to  have  uniformly  separated  wavelengths  have 
also  been  demonstrated  [Chang-Hasnain,  1990].  We  shall  return  to  this  characteristic 
in  a  succeeding  section  that  addresses  a  particular  strategy  for  photonic  neural  network 
implementation. 

At  present,  each  laser  within  the  array  operates  at  a  threshold  voltage  of  about  10 


volts  at  a  threshold  current  of  a  few  milliamperes,  resulting  in  a  power  dissipation  of  10 
-  50  milliwatts  per  device  at  threshold,  and  higher  for  power  outputs  significantly  above 
threshold.  In  order  to  keep  the  overall  power  density  within  established  limits  (1-10 
Wlcrri^)  and  thus  to  keep  from  overheating  the  substrate  (resulting  in  potentially  delete¬ 
rious  effects  on  wavelength  stability  and/or  catastrophic  failure),  the  lasers  must  either  be 
spaced  appropriately,  operated  in  a  pulsed  (on/off)  mode  at  less  than  unity  duty  cycle,  or 
temporally  multiplexed  (turning  on  only  a  few  lasers  at  a  time)  by  resorting  to  individual 
rather  than  parallel  addressing.  Given  the  current  rate  of  progress  in  the  development  of 
these  and  other  types  of  surface-emitting  laser  arrays,  it  is  reasonable  to  expect  demon¬ 
stration  of  continuous  operation  of  up  to  10'*  microlasers  per  square  centimeter  within  the 
near  future. 

It  should  be  noted  that  the  array  shown  in  Figure  15.18  is  not  currently  configured 
for  parallel  operation  of  all  of  the  sources  simultaneously,  which  would  require  electrical 
contact  to  the  tops  of  each  selectively  etched  microcavity.  This  feature  could  likely  be 
provided  by  an  additional  surface  passivation  and  metallization  step.  Matrix-addressable 
surface-emitting  laser  arrays  have  recently  been  fabricated  by  forming  colunms  of  lasers 
separated  by  etched  isolation  grooves,  and  interconnected  across  the  grooves  by  striped  row 
contacts  [Orenstein,  1990a].  Application  of  2m  appropriate  bias  voltage  2u:ross  a  given  pair 
of  electrodes  (column  2ind  row)  activates  the  laser  diode  at  the  intersection,  2dlowing  for 
raster- sc2mned  operation2d  modes  2is  well  as  fully  par2dlel  operation  [Von  Lehmen,  1990). 

Other  currently  investigated  approaches  to  surface-emitting  laaer  array  fabrication  use 
various  techniques  to  form  the  microlaser  cavities  within  the  pl2mar  substrate  without  the 
need  for  deep  etched  isolation  grooves,  such  as  by  the  use  of  ion  implemtation  to  form 
electric2dly  insulating  isolation  layers  between  the  laiser  cavities  [Tad,  1989a;  Orenstein, 
1990b]  or  by  the  current  confinement  that  results  from  photolithographic  definition  of 
one  of  the  two  liiser  mirrors  amd  its  aissociated  electrical  contact  [Tai,  1989b].  Fabrication 
processes  that  yield  planau:  or  quasi-plamar  device  structures  adlow  for  direct  pau-allel  contact 
if  desired  without  the  complications  of  depositing  contacts  on  verticatl  sidewalls. 

Before  leaving  the  subject  of  semiconductor  laiser  diodes  and  surface-emitting  laiser  ar¬ 
rays,  it  is  worthwhile  to  note  a  very  useful  feature  of  such  devices:  their  capacity  for  high 


bandwidth  direct  modulation.  By  this  we  mean  that  the  output  intensity  of  the  semicon¬ 
ductor  laser  source  can  be  modulated  (at  full  modulation  depth,  i.e.  from  well  below  the 
threshold  for  lasing  to  peak  output  power)  at  frequencies  up  to  a  few  gigahertz  by  direct 
variation  of  the  voltage  applied  aicross  the  device.  This  attribute  can  be  used  to  advantage 
in  many  neuro-optical  implementation  architectures  by  eliminating  the  need  for  mechani¬ 
cal  or  electrooptical  shutters,  as  well  as  by  offering  temporal  multiplexing  as  an  additional 
degree  of  freedom  for  the  systems  designer. 

One  additional  type  of  solid  state  device  that  is  capable  of  both  single  source  <ind  source 
array  fabrication  is  the  light  emitting  diode  (LED).  Closely  related  to  the  semiconductor 
laser  diode,  the  LED  is  also  a  p-n  junction  device  that  can  be  fabricated  with  considerably 
less  processing  complexity  by  elimination  of  the  high  reflectivity  mirrors  that  form  the 
semiconductor  laser  cavity.  An  additional  advantage  is  the  lack  of  a  threshold  for  oper¬ 
ation,  allowing  the  LED  to  emit  over  a  much  wider  dynamic  range  of  applied  voltages. 
One  drawback  of  emitting  diodes  is  that  they  are  relatively  broadband  (incoherent) 
sources,  and  as  such  are  not  usable  as  sources  for  holographic  recording  applications  (and 
in  many  cases  for  readout  of  multiplexed  holographic  optical  elements  as  well).  In  ad¬ 
dition,  they  are  relatively  inefficient  emitters  with  typical  electrical-to-optical  conversion 
efficiencies  of  a  few  percent.  This  feature  tends  to  make  LEDs  rather  power  consumptive 
for  a  given  amount  of  usable  output  intensity. 


Detectors  and  Detector  Arrays 

Detectors  are  optoelectronic  components  that  act  as  photon-to-electron  converters,  in 
that  they  transform  incident  optical  intensity  into  electronic  form,  usually  a  voltage  or  a 
current.  Detectors  therefore  allow  the  optical  representation  of  neuron  unit  outputs,  for 
example,  to  be  converted  into  an  electronic  representation  for  further  processing.  As  such, 
they  are  important  components  for  the  photonic  implementation  of  neural  networks  in  at 
least  two  functional  areas:  (a)  as  input  transducers  for  the  necessary  optical  detection 
function  of  optically  addressed  spatial  light  modulators,  and  (b)  as  output  transducers 
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for  the  translation  of  optically  generated  intermediate  and  final  results  to  an  appropriate 
electronic  format.  After  all,  once  you’ve  gone  to  all  of  the  trouble  of  learning  and  computing 
with  a  neural  network,  it  might  prove  worthwhile  occasionally  to  actually  get  the  answer 
out  and  use  it  to  initiate  some  other  useful  process! 

In  both  of  these  functional  areas,  we  can  further  categorize  detectors  as  (a)  single  pixel 
detectors,  and  (b)  interconnected  detector  ar  ays.  In  the  first  category,  we  include  both 
single  element  detectors  that  have  one  optical  input  aperture  and  one  output  channel,  as 
well  as  the  single  pixel  detectors  employed  as  part  of  an  array  in  two-dimensional  spatial 
light  modulators.  This  latter  assignment  is  made  because  even  though  detectors  used  in 
spatial  light  modulators  are  perhaps  configured  in  an  array,  their  outputs  are  used  only 
within  one  or  at  most  a  few  local  pixels.  In  the  second  category,  we  include  arrays  of 
detector  elements  that  are  interconnected  in  such  a  way  that  the  entire  parallel  (one- 
or  two-  dimensional)  array  can  be  read  out  electronically  through  one  or  more  output 
channels.  An  example  of  a  detector  array  in  this  category  might  be  the  light  sensitive 
element  in  the  CCD  (charge-coupled-device)  camera,  now  commonplace  in  many  solid 
state  cameras  zuid  video  cassette  recorders. 

This  distinction  between  single  pixel  detectors  and  detector  arrays  is  important  because 
the  technologies  that  are  commonly  employed  in  these  two  cases  differ  in  a  number  of 
respects,  zmd  as  a  result  can  exhibit  wide  differences  in  performance  characteristics  such  as 
bandwidth,  sensitivity,  linearity,  and  dynamic  range.  In  the  case  of  single  pixel  detectors, 
for  example,  it  proves  easier  to  jointly  optimize  performance  parameters  because  of  the 
larger  number  of  degrees  of  freedom  available  to  the  device  designer  in  a  single  input,  single 
output  system.  The  detector  array  designer,  on  the  other  hand,  often  must  make  additional 
tradeoffs  dictated  by  the  nature  of  the  charge  storage  and  readout  process  employed  over 
the  full  set  of  integrated  pixels. 

In  the  context  of  photonic  neural  network  implementations,  single  pixel  detectors  have 
two  primary  functions.  The  first  is  to  act  as  optical  signal  to  electronic  signal  converters 
within  optically  addressed  spatial  light  modulators,  to  translate  a  pixel’s  worth  of  incident 
light  intensity  (representing,  for  example,  the  weighted  sum  of  signals  from  the  output  of 
a  plane  of  neuron  units)  into  a  voltage  or  current.  The  resultant  electronic  signal  can  then 
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be  processed  by  local  intrapixel  circuitry  to  produce  the  desired  neural  threshold  func¬ 
tion  for  subsequent  optical  encoding  (modulation).  This  process  could  be  accomplished 
either  onboard  a  monolithic  or  hybrid  integrated  optically  addressed  spatial  light  modula¬ 
tor  (OASLM),  or  on  a  separate  detector  chip  that  interfaces  with  an  electrically  addressed 
spatial  light  modulator  (EASLM).  In  this  latter  case,  the  detector  will  most  likely  fall 
under  the  detector  array  category  discussed  further  below,  since  a  parallel-to-seriaJ  con¬ 
version  is  typically  required  to  extract  the  array  of  data  {e.g.,  an  image)  from  the  detector 
chip  (followed  by  a  serial-to-parallel  conversion  to  load  the  signal  into  the  EASLM).  It 
should  be  noted  that  even  in  the  case  of  monolithic  spatial  light  modulators  that  do  not 
feature  discrete  detectors,  electronic  control  circuitry,  and  modulators,  converting  a  two- 
dimensional  optical  input  distribution  into  a  modified  two-dimensional  output  distribution 
necessarily  involves  a  local  detection  function,  even  if  it  is  not  pairticularly  easy  to  separate 
the  detection  process  from  the  modulation  process. 

The  second  important  single  pixel  detector  function  is  to  provide  for  single  point  mon¬ 
itoring  functions  within  the  system,  such  as  the  output  power  from  a  given  laser  source, 
the  average  power  emitted  from  a  laser  source  array,  or  a  particular  system  output  that 
activates  a  desired  process  or  function  (for  example,  the  identification  of  a  specific  defect 
pattern  on  a  manufactured  part  within  the  input  image  field  of  a  neuraJ  image  processor, 
that  in  turn  results  in  rejection  of  the  part). 

Perhaps  the  simplest  type  of  detection  element  that  can  be  incorporated  in  a  single 
pixel  is  the  photoconductor,  which  typically  consists  of  a  thin  film  of  material  that  alters 
its  resistance  to  electrical  current  in  response  to  the  intensity  of  incident  illumination. 
The  most  conunonly  used  single  pixel  photodetectors,  however,  are  baised  in  some  way  or 
other  on  the  semiconductor  p-n  junction  diode.  Under  reverse  bias  in  a  p-n  junction  diode, 
photocarriers  created  by  light  absorbed  within  the  region  of  the  junction  between  n-type 
and  p-type  semiconductor  layers  are  swept  out  of  the  junction  region  by  the  internal  electric 
field  across  the  junction,  and  collected  in  the  external  circuit.  If  the  internal  electric  field 
is  high  enough,  each  photocarrier  can  acquire  enough  energy  during  sweepout  to  generate 
an  avalanche  of  additional  carriers,  leading  to  significant  gain  in  the  class  of  so-called 
avalanche  photodiodes. 
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The  inclusion  of  an  intrinsic  (undoped  or  compensated)  layer  of  semiconductor  mate¬ 
rial  between  the  n-type  and  p-type  layers  allows  for  a  significant  reduction  in  the  junction 
capacitance  of  the  device,  with  a  corresponding  improvement  in  signal  bandwidth.  Such 
devices  are  commonly  referred  to  as  p-i-n  photodiodes,  packaged  versions  of  which  are 
commercially  available  for  a  wide  variety  of  photosensor  functions.  Typical  p-i-n  photo¬ 
diodes  exhibit  risetimes  of  a  few  nanoseconds,  eire  linear  in  output  over  seven  orders  of 
magnitude  of  input  intensity,  and  are  sensitive  to  very  low  light  intensity  levels.  For  silicon 
p-i-n  photodiodes,  sensitivities  of  about  0.4  milliamperes  of  output  current  per  milliwatt  of 
optical  input  power  at  a  wavelength  of  830  nm  are  common,  which  represents  a  conversion 
efficiency  from  photons  to  electrons  of  approximately  60%. 

Phototransistors  are  light  sensitive  devices  that  exhibit  current  gain  in  exactly  the  same 
manner  as  a  transistor,  with  the  exception  that  the  controlling  base  current  is  injected  op¬ 
tically  rather  than  through  the  base  lead.  In  fact,  most  VLSI  transistors  (both  bipolar  and 
MOS)  are  photosensitive  (though  perhaps  not  optimized  for  the  photodetector  role),  and 
must  be  protected  from  stray  light  in  order  not  to  compromise  their  performance  character¬ 
istics.  The  principal  advantage  of  a  phototransistor  is  its  inherent  current  gain  of  order  100 
to  1000,  which  often  makes  the  interface  of  the  photodetector  to  following  circuitry  more 
straightforward.  In  cases  requiring  exceptionally  high  gain  in  the  front  (photodetection) 
end,  two  transistors  can  be  paired  ais  shown  in  Figure  15.19  so  that  one  acts  as  a  phototran¬ 
sistor,  and  the  other  as  a  current  amplifier.  Such  a  two  transistor  combination  has  achieved 
widespread  use,  and  is  referred  to  as  a  photo-Darlington  pair  [Sze,  1981b].  The  tradeoffs 
for  increased  gain  in  both  of  these  cases  are  risetime  (which  translates  directly  into  signal 
bandwidth)  and  area  required  for  integration.  Typical  risetimes  for  phototransistors  are 
almost  three  orders  of  magnitude  higher  (a  few  microseconds)  than  those  characteristic 
of  p-i-n  photodiodes.  Photo-Darlingtons  are  yet  another  factor  of  ten  or  so  slower  in  re¬ 
sponse  time.  Optimized  phototransistors  and  photo-  Darlingtons  require  relatively  large 
collector-base  junctions  in  order  to  provide  an  appropriately  sized  photosensitive  region 
that  can  be  accessed  by  optical  imaging  techniques. 


FIGURE  15.19  Schematic  diagram  of  a  photodarlington  pair  utilized  as  a 
high  gain  detector /amplifier  combination. 


In  many  if  not  most  cases,  the  type  of  photodetector  chosen  for  use  as  a  single  pixel 
detector  in  a  spatial  light  modulator  application  depends  on  its  integrability  with  associ¬ 
ated  control  electronics  and  molulation  elements.  This,  in  turn,  depends  on  whether  the 
particular  spatial  light  modulator  in  question  is  monolithically  or  hybrid  integrated,  as 
discussed  in  the  section  on  photonic  switching  above,  and  on  which  semiconductor  sub¬ 
strate  the  photodetection  element  itself  is  to  be  fabricated.  In  some  caises,  the  desire  for 
integration  of  a  high  density  of  neuron  units  may  place  strict  bounds  on  the  area  allocated 
to  each  separate  function  in  general,  and  on  the  photodetection  and  requisite  amplification 
function  in  particular. 

In  traditional  applications  of  photodetector  technology,  for  example  in  spectroscopy 
and  optical  metrology,  linearity  of  response  (output  voltage  or  current  as  a  function  of  the 
input  intensity)  is  prized,  as  is  a  wide  dynamic  range  over  which  linearity  is  assured.  In 
neural  network  applications,  however,  linearity  is  typically  less  of  an  issue.  In  fact,  it  is 
often  convenient  to  use  the  inherent  nonlinearity  of  the  input-output  characteristic  of  a 
particular  photodetector  device  to  generate  part  or  all  of  the  nonlinearity  required  of  the 
overall  neural  unit  function.  This  can  result  in  a  lower  overall  expenditure  of  real  estate 
for  each  neuron  unit,  increasing  the  neuron  array  density,  as  well  as  in  a  reduction  of 
circuit  complexity  within  each  pixel.  One  such  example  is  the  output  current  saturation 
characteristic  of  phototransistors  at  high  input  intensities,  which  can  be  used  to  emulate 
the  upper  saturation  regime  of  the  sigmoidal  neuron  response  function. 

Detector  arrays  are  employed  whenever  the  intensity  distribution  of  a  one-  or  two- 
dimensional  image  field  requires  conversion  to  electronic  form  for  interface  with  succeeding 
computational  or  output  stages  of  the  system.  In  a  very  real  sense,  a  two-  dimensional 
detector  array  is  nothing  more  than  the  business  end  of  an  optoelectronic  camera  that 
can  be  positioned  anywhere  within  the  optical  system  that  the  local  intensity  distribution 
represents  a  desired  result.  In  fact,  low  reflectivity  beamsplitters  can  be  used  to  merely 


"sample”  the  local  intensity  distribution  of  a  given  beam  of  light,  allowing  most  of  the 
incident  light  to  propagate  in  a  further  computational  arm  of  the  optical  train  for  use 
elsewhere. 

Detector  arrays  eire  inherently  different  in  at  least  one  key  respect  from  the  single  pixel 
photodetectors  (as  well  eis  arrays  of  photodetectors  used  in  optically  addressed  spatial  light 
modulators)  discussed  previously:  the  need  to  provide  for  some  form  of  output  channel 
multiplexing,  in  order  to  avoid  the  requirement  for  a  one-to-one  correspondence  between 
pixels  in  the  array  and  output  pins.  For  example,  in  a  1000  x  1000  element  detector 
eurray,  fully  parallel  readout  requires  one  million  output  channels  or  pinouts.  As  a  result, 
detector  arrays  are  usually  configured  to  perform  some  form  of  parallel-to-serial  conversion 
of  the  data  into  a  single  high  bandwidth  serial  channel  prior  to  the  readout  of  each  frame 
(though  multiple  output  channels  can  also  be  used).  This  can  either  add  significant  circuit 
complexity  to  the  area  surrounding  each  pixel  in  order  to  accommodate  for  the  parallel-to- 
serial  conversion  and  interpixel  communication  function,  or  be  directly  incorporated  into 
the  design  of  the  photodetection  elements  themselves,  as  in  the  case  of  the  CCD  arrays 
discussed  below. 

The  state  of  the  art  of  detector  arrays  has  advanced  tremendously  even  over  the  past 
decade,  to  the  point  where  solid  state  detector  arrays  with  quite  spectacular  performance 
are  used  everywhere  from  astronomical  applications  (ais  detectors  for  even  the  lau’gest  tele¬ 
scopes),  to  earth  observation  satellites  (infrared  focal  plane  arrays),  to  photomicroscopy  (in 
place  of  the  tr2Mlitional  him- based  photographic  camera),  to  consumer  products  (electronic 
still  photography  and  video  cameras). 

One  of  the  most  successful  and  generally  available  types  of  solid  state  detector  ar¬ 
ray  is  the  charge-coupled-device  (or  CCD)  array  [Sze,  1981c;  Optical  Enginttring,  1987a; 
Optical  Engineering,  1987b].  In  this  technology,  usually  based  on  MOS  fabrication  tech¬ 
niques  in  silicon  (but  adaptable  to  compound  semiconductor  substrates  as  well),  incident 
illumination  within  a  given  pixel  causes  the  accumulation  of  photogenerated  charge  in  an 
electrostatic  potential  well  formed  by  the  application  of  bias  voltages  on  a  set  of  electrodes, 
as  shown  schematically  in  Figure  15.20.  In  operation,  the  CCD  array  is  illuminated  for 
a  given  exposure  time  (slightly  less  than  one  full  frame  interval),  during  which  time  the 
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charge  generated  by  the  incident  illumination  is  integrated  within  each  primary  well.  Sub¬ 
sequently,  appropriate  voltages  are  applied  by  means  of  multi-phased  electrode  structures 
to  spill  the  accumulated  charge  packet  into  the  neighboring  well,  while  simultaneously 
moving  the  charges  in  the  neighboring  well  to  its  neighboring  well,  and  so  on  throughout 
the  array. 


FIGURE  15.20  Schematic  diagram  of  a  charge  coupled  device  (CCD)  pho¬ 
todetector  array  fabricated  on  a  silicon  substrate.  Electrostatic  potential  wells 
are  created  by  application  of  appropriate  voltages  to  the  three  phase  bias  elec¬ 
trode  structure,  with  electrical  isolation  provided  by  the  gate  oxide  layer.  Light 
incident  through  the  transparent  electrodes  creates  stored  charge  that  can  be 
transferred  to  an  output  signal  terminal  by  proper  sequential  phasing  of  the 
bias  voltages  (Pj  -  P3). 
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The  overall  operation  resembles  the  function  of  an  array  of  one-dimensional  shift  reg¬ 
isters.  At  one  edge  of  the  structure,  the  charge  packets  from  each  row  are  collected  into  a 
single  column  that  is  read  out  by  a  very  high  speed  shift  register  (a  linear,  usually  buried 
channel  CCD  array;  io  form  the  single  output  channel.  Full  readout  of  the  array  must 
occur  before  the  next  frame  is  exposed  (except  in  specifically  designed  cases  such  as  the 
time-delay-and-integrate  or  TDI  mode  of  operation,  in  which  only  one  shift  is  interposed 
between  successive  exposures). 

One-dimensional  arrays  of  CCD  elements  have  been  successfully  fabricated  in  sizes 
of  1  X  2048,  while  special  purpose  two-dimensional  CCD  imaging  arrays  2048  x  2048  in 
size  are  commercially  available  [Blouke,  1987J.  This  represents  a  parallel  detector  with 
4,194,304  individual  pixels!  In  one  particular  2048  x  2048  CCD  array,  the  imaging  area  is 
5.5  X  5.5  centimeters,  with  a  pixel  size  of  27  x  27  microns.  This  array  exhibited  a  dark 
(unilluminated)  noise  buildup  in  each  pixel  of  only  6  to  12  electrons  when  read  out  at  a 
rate  of  50,000  pixels  per  second,  which  allows  for  detection  of  extremely  low  level  signals 
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with  excellent  signal-to-noise  ratio.  Given  a  well  capacity  of  about  700,000  electrons,  this 
very  low  noise  figure  suggests  a  dynamic  range  in  excess  of  70,000,  nearly  five  orders  of 
magnitude!  For  well  charge  densities  less  than  200,000  per  pixel,  the  linearity  is  better 
thaji  0.5%  over  this  portion  of  the  full  dynamic  rzmge.  Finally,  this  array  exhibited  an 
extraordinarily  high  charge  transfer  efficiency  of  0.999992,  representing  the  fraction  of 
charge  within  a  given  pixel  that  is  routinely  transferred  to  an  adjacent  pixel  without  loss. 

The  integration  of  large  scale  detector  arrays  by  means  of  VLSI  techniques  provides  the 
prospect  of  special  purpose  arrays  that  perform  part  of  the  computational  function  within 
the  confines  of  the  array.  One  example  of  such  special  purpose  chips  is  the  incorporation 
in  a  CCD  array  of  charge-coupled  analog  circuitry  to  perform  arithmetic  operations  such 
2is  addition,  subtraction,  and  magnitude  comparison  [Fossum,  1987).  Such  an  array  could 
allow  for  detection  of  parallel  differential  outputs,  with  both  positive  (excitatory)  and  neg¬ 
ative  (inhibitory)  weighted  sums  as  dual  optical  inputs  in  a  (positive  definite)  intensity 
representation. 


ARCHITECTURAL  CONSIDERATIONS  FOR 
PHOTONIC  NEURAL  NETWORK  IMPLEMENTATIONS 


We  now  turn  our  attention  to  the  use  of  the  photonic  components  and  fundamental 
principles  described  above  in  the  implementation  of  highly  parallel  neural  network  architec¬ 
tures.  The  focus  in  this  section  is  on  a  general  framework  that  emphasizes  characteristics 
common  to  different  approaches  to  photonic  and  optical  neural  network  implementation- 
s,  as  well  as  on  illuminating  some  of  the  key  fundamental  differences  among  the  various 
implementation  approaches.  A  review  of  recent  and  ongoing  research  in  photonic  and  op¬ 
tical  neural  network  implementations  is  beyond  the  scope  of  this  chapter;  sources  of  such 
information  can  be  found  in  the  Suggested  Further  Reading  section  at  the  end  of  this 
chapter. 

Photonic  neural  network  implementations  can  be  adaptive  or  non-adaptive,  can  repre¬ 
sent  the  signal  using  different  physical  quantities,  and  can  be  built  using  one-dimensional 


63 


(1-D)  or  two-dimensional  (2-D)  arrays  of  neuron  units  with  two-dimensional  or  three- 
dimensional  (3-D)  interconnection  elements.  These  issues,  in  addition  to  other  features 
that  are  desirable  in  «iny  photonic  implementation  of  a  neurd  network,  are  discussed  in 
this  section.  Throughout,  one  should  keep  in  mind  the  distinctions  that  exist  among 
systems  with  fixed  interconnections,  programmable  systems,  and  truly  adaptive  systems. 
We  will  initially  concentrate  on  the  implementation  of  a  single  layer  of  a  network,  and 
subsequently  show  how  this  generalizes  to  multiple  layers. 

The  computation  process  of  any  one  layer  of  a  neural  network  can  be  represented  by: 


yi=  f 


(23) 


L  i  J 

in  which  neuron  unit  j  is  situated  at  the  input  to  the  layer  of  interconnections,  neuron  unit 
i  is  situated  at  the  output  of  the  layer  of  interconnections,  y,-  is  the  output  of  neuron  unit 
i,Xj  is  the  output  of  neuron  unit  j,  tojj  is  the  weight  associated  with  the  interconnection 
between  them,  and  the  function  /  represents  the  neuron  unit  nonlinearity.  The  term  inside 
the  brackets,  the  activation  potential,  will  be  denoted  by  />,•.  Note  that  the  term  in  brackets 
is  a  matrix-vector  product  between  an  interconnection  weight  matrix  and  an  input  vector. 
The  function  /  then  operates  independently  on  each  element  of  the  resulting  vector;  this  is 
called  a  point  nonlinearity,  and  as  such  lends  itself  to  implementation  with  a  spatial  light 
modulator  (SLM). 

Most  current  learning  algorithms  fall  into  one  of  a  small  number  of  classes.  For  example, 
one  such  class  can  be  specified  by: 


Awij  =  aSiXj  —  0Wij  (24) 

in  which  Arvij  =s  u;,j(fc-|- 1)  —  tVij{k)  is  the  weight  update,  k  represents  the  iteration  index, 
Q  is  the  learning  gain  constant,  and  0  is  a  decay  constant  that  is  included  primarily  for 
hardware  convenience;  0  can  be  set  to  0  when  so  desired.  Suitable  choices  of  Si  give 
different  le2U‘ning  algorithms,  such  as  Hebbian,  Widrow-Hoff,  single-layer  least  minimum 
squares  (LMS),  and  for  the  case  of  multilayer  networks,  backward  error  propagation.  (For 


example,  an  optical  architecture  that  potentially  implements  backward  error  propagation 
in  a  multilayer  neural  network  is  described  by  Wagner  and  Psaltis  [Wagner,  1987]).  In  this 
chapter  we  will  restrict  our  attention  to  this  particular  class  of  algorithms  for  illustrative 
purposes.  Although  other  classes  of  learning  algorithms  can  likely  also  be  implemented 
using  photonic  hardware,  research  to  date  has  focused  primarily  on  the  class  represented 
by  Equation  (24).  An  important  aspect  of  Equation  (24)  for  implementation  is  the  outer 
product  between  the  training  vector  8  and  the  input  vector  x  for  the  weight  matrix  update. 

An  example  of  a  photonic  neural  system  is  shown  in  block  diagram  form  in  Figure 
15.21.  This  system  utilizes  a  1-D  array  of  neuron  units  at  the  input  2md  output,  and 
a  2-D  interconnection  mask.  Each  pixel  in  the  input  is  expanded  optically  (using  cylin¬ 
drical  lenses)  and  illuminates  the  corresponding  row  of  the  interconnection  mask.  The 
mask  stores  the  analog  weights,  and  provides  a  pointwise  multiplication  before  the  beam 
is  contracted  so  that  one  column  from  the  mask  is  incident  onto  one  corresponding  output 
pixel.  The  optical  system  in  effect  provides  a  fully  parallel  analog  optical  matrix-vector 
multiplication  as  represented  by  the  bracketed  term  in  Equation  (23),  performed  over  all  i. 
Threshold  functions  and  feedback  connections  are  provided  by  means  of  either  photonics 
or  electronics.  The  first  experimental  demonstration  of  such  a  system  applied  to  neural 
network  implementations  used  an  array  of  light  emitting  diodes  (LEDs)  as  inputs  to,  and  a 
linear  detector  array  as  the  output  from,  the  optical  interconnection  [Psaltis,  1985;  Fau-hat, 
1985].  This  particular  system  utilized  electronics  to  provide  the  threshold  functions  and 
feedback  connections. 


FIGURE  15.21  Block  diagram  of  a  1-D  to  1-D  photonic  neural  network, 
in  which  a  one-dimensional  neuron  array  is  fully  interconnected  to  a  one¬ 
dimensional  detector  array  by  means  of  a  two-dimensional  interconnection 
mask. 


It  should  be  noted  that  many  variants  of  Figure  15.21  are  possible;  some  of  them  are 


more  compact  than  others,  though  all  of  them  share  essentially  the  same  basic  characteris¬ 
tics.  The  interconnection  mask  can  be  fixed  photographic  film)  or  variable  {e.g.,  an 
SLM).  In  the  latter  case  the  SLM  can  be  electronically  or  optically  euidressed.  Electron¬ 
ic  addressing  is  appropriate  for  straightforward  interfacing  to  an  electronic  machine  that 
supplies  the  (updated)  interconnection  weights,  whereas  for  a  maximum  adaptation  rate 
an  optical  addressing  technique  would  ultimately  be  optimal.  Currently  available  SLMs 
with  large  numbers  of  pixels  tend  to  be  slow  (500  x  500  analog  pixels  with  1  -  100  ms 
frame  times)  [Tanguay,  1985];  much  faster  technologies  are  being  developed  for  future  use 
[see,  for  example,  Lentine,  1988;  Lentine,  1991;  McCormick,  1989b].  Such  a  system,  with 
1-D  inputs,  1-D  outputs,  and  2-D  interconnections,  will  likely  scale  up  to  100  -  1000  fully 
connected  neuron  units. 

A  photonic  system  that  can  implement  larger  numbers  of  neuron  units  and  intercon¬ 
nections  is  shown  in  Figure  15.22.  All  neuron  unit  plauies  are  now  2-D  arrays,  and  the 
interconnection  medium  is  a  3-D  structure,  implemented  in  a  volume  holographic  mate¬ 
rial.  In  effect,  there  is  a  separate  volume  grating  connecting  each  input  neuron  unit  j  to 
each  output  neuron  unit  i.  The  diffraction  efficiency  of  each  grating  is  proportional  to  the 
weight,  Wij,  of  the  corresponding  interconnection.  Note  that  each  such  grating  is  analogous 
to  a  beamsplitter,  as  discussed  in  the  previous  section,  with  the  primary  difference  that  the 
volume  gratings  are  direction  (and  wavelength)  selective.  Thus,  beams  incident  on  such 
a  “beamsplitter”  at  other  than  the  correct  angle  £ire  not  affected  by  the  presence  of  the 
holographic  beamsplitter.  Properly  recorded,  then,  the  grating  Wij  is  situated  in  angular 
orientation  and  grating  period  so  that  it  2dFects  only  the  inputs  at  the  angle  corresponding 
to  Xj  ,  and  will  direct  the  corresponding  output  WijXj  to  the  correct  summation  node  p,. 
The  achievable  numbers  of  neuron  units  and  interconnections  are  currently  subjects  of 
considerable  debate,  but  would  likely  be  10^  -  10®  neuron  units  per  pleine  and  on  the  order 
of  10*°  independent  interconnections  with  weights,  assuming  continued  research  unveils  no 
impassable  boundaries. 


FIGURE  15.22  Block  diagram  of  a  2-D  to  2-D  photonic  neural  network. 


in  which  a  two-dimensionaJ  neuron  array  is  fully  interconnected  to  a  two- 
dimensional  output  array  by  means  of  a  three-dimensional  volume  holographic 
optical  interconnection  mask.  The  input  plane,  output  plane,  and  optional 
training  plane  are  shown.  Many  variants  of  this  geometry  with  similar  proper¬ 
ties  are  possible. 


For  the  case  of  an  adaptive  network,  we  use  a  variable  (typically  photorefractive)  holo¬ 
graphic  material  for  recording  and  implementing  the  interconnections.  To  incorporate 
learning,  a  training  plane  comprising  a  2-D  array  of  nodes  generates  the  8i  terms  (Figure 
15.22).  During  a  weight  update,  an  exposure  is  made  of  the  interference  pattern  between 
beams  emanating  from  the  two  left  hand  planes  in  the  figure.  Each  of  the  two  left  hand 
planes  could  be  implemented  using,  for  example,  a  2-D  spaticil  light  modulatpr  illuminated 
by  an  expanded  beam.  This  results  in  a  change  in  the  refractive  index  modulation  repre¬ 
senting  the  current  weight  that  is  dependent  on  the  product  6iXj,  so  that  with  appropriate 
choices  of  parameters,  the  increment  in  diffraction  efficiency  can  be  made  proportional 
to  SiXj.  Ideally,  this  records  changes  (updates)  in  the  interconnection  weights  within  the 
hologram  given  by  Equation  (24),  above,  in  the  form  of  gratings  situated  with  appro¬ 
priate  angular  orientation  and  grating  period.  It  should  be  noted  that  generating  and 
recording  these  weight  updates  is  not  a  simple  matter,  2uid  care  must  be  taken  to  insure 
that  the  appropriate  interference  terms  are  recorded  and  that  not  too  much  crosstalk  is 
inadvertently  created.  Recording  and  recall  of  the  correct  values  is  primarily  a  number 
representation  issue  and  is  discussed  below;  undesirable  crosstalk  depends  on  the  record¬ 
ing  and  reconstruction  technique  as  previously  discussed  in  the  subsection  on  “Photonic 
Interconnections” . 

An  example  of  one  source  of  holographically-induced  interconnection  crosstalk  is  an 
inadvertent  degeneracy  of  gratings.  Even  though  each  volume  grating  affects  only  the 
beams  incident  at  a  particular  angle  with  respect  to  the  grating,  it  affects  all  of  the  beams 
at  that  particular  tingle.  Because  of  this,  an  entire  cone  of  beams  (with  its  axis  of  symmetry 
aligned  with  the  grating  wave  vector)  can  be  affected  by  a  single  diffraction  grating.  This 


degeneracy  creates  an  undesired  coupling  between  different  interconnections  in  a  fully 
connected  network.  For  neuron  unit  sources  on  an  ideal,  rectangular  grid,  this  coupling  can 
be  eliminated  by  removing  neuron  units  from  certain  locations  in  the  array,  leaving  sparsely 
distributed  neuron  units  arranged  in  a  degeneracy  breaking  pattern.  This  eliminates  the 
undesired  coupling,  at  the  expense  of  a  reduction  in  the  number  of  neuron  units  from  N'^ 
(for  am  N  X  N  array)  to  [Psaltis,  1989]. 

The  case  of  a  non-adaptive  network  is  likely  to  be  an  important  one  as  well.  In  this  c«ise 
the  interconnection  hologram  does  not  have  to  be  recorded  in  accordance  with  a  specific 
leaning  algorithm.  If  the  weights  are  known  a  priori,  then  any  applicable  recording 
technique  will  suffice.  In  many  cases,  however,  the  weights  may  not  be  known.  A  common 
scenario  may  involve  the  training  of  a  “master”  network;  once  it  htis  been  trained,  copies 
of  the  network  could  be  produced  in  a  production  environment.  If  the  network  is  large, 
and  p<irticularly  if  it  utilizes  volume  holographic  optical  interconnections,  then  probing  the 
values  of  zJl  of  the  weights  could  be  impractical.  The  most  efficient  production  means  in 
this  case  would  be  to  make  direct  copies  of  the  volume  hologram.  Thus,  the  capability  of 
rapidly  copying  a  multiplexed  volume  interconnection  hologram  is  important. 

The  physical  representation  of  the  signal  directly  impacts  the  operation  of  a  photonic 
neural  network.  The  physical  quantities  available  for  optical  representation  of  a  signal 
level  are  field  amplitude,  phase,  intensity,  polarization,  spatial  position  or  frequency,  and 
wavelength.  We  will  consider  only  the  most  likely  candidates;  field  amplitude  (with  phase) 
and  intensity.  For  the  case  of  an  amplitude  (with  phase)  representation,  the  signals  may  in 
general  be  complex  vailued;  bipol2U'  signals,  of  course,  represent  a  subset  of  these  numbers, 
and  thus  can  be  represented.  Given  that  x  and  y  are  represented  as  (electric  or  magnetic) 
field  amplitudes,  the  resulting  detected  activation  potential  of  neuron  unit  i,pi,  is  given  by 

^(coh)  ^  I  ^25) 
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for  the  case  of  a  coherent  sum,  and  by 
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for  the  case  of  an  incoherent  sum  (c./.  the  preceeding  section  on  “Fundamental  Principles 
of  Photonic  Technology”).  In  both  Equations  (25)  and  (26),  the  weight  Wij  is  represented 
physically  by  the  amplitude  diffraction  efficiency.  The  coherent  sum  given  by  Equation 
(25)  has  the  advauitage  of  2Lllowing  for  the  addition  of  both  positive  and  negative  numbers 
in  computation  of  the  neuron  unit  potential,  as  desired  for  the  incorporation  of  both 
excitatory  and  inhibitory  neuron  unit  inputs.  Clearly,  Equations  (25)  and  (26)  deviate 
from  conventional  neural  network  models.  The  effects  on  different  neural  network  models 
of  such  deviations  in  the  summation  before  thresholding  are  not  currently  well  understood. 

If  we  instead  encode  the  signal  levels  as  intensities,  the  activation  potential  becomes 

(27) 
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which  is  the  desired  activation  potential,  but  at  the  expense  of  all  terms  in  the  summation 
being  nonnegative.  In  this  case  the  weight  Wij  is  represented  physically  by  the  intensity 
diffraction  efficiency.  A  technique  for  effectively  achieving  bipolar  signals  in  this  case  will 
be  discussed  in  the  section  describing  “An  Implementation  Strategy” . 

The  signal  representation  used  also  impacts  the  nature  of  the  weight  updates.  The 
physical  weight  updates  can  be  derived  using  common  models  of  photorefractive  (or  other) 
recording  materials.  Such  a  derivation  requires  a  number  of  approximations  and  assump¬ 
tions  to  be  made  regarding  the  chosen  operational  mode.  By  appropriate  choice  of  the 
operational  mode,  the  ideal  weight  update  rule  given  by  Equation  (24)  can  be  approxi¬ 
mately  obtained  for  both  intensity  representation  and  amplitude  representation  cases.  The 
operational  mode  may  not  prove  to  be  the  same  in  each  case,  and  may  differ  in  such  param¬ 
eters  as  the  size  of  the  weight  updates,  the  size  of  the  existing  weights  before  the  update, 
and  the  exact  characteristics  of  the  holographic  material  used.  The  “second  order”  terms 
that  deviate  from  the  precise  form  of  Elquation  (24)  aue  also  different  in  the  two  cases; 
the  effect  of  such  terms  on  learning  algorithm  performance  is  not  well  characterized  or 
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understood,  and  is  currently  an  active  area  of  research. 

So  far  we  have  discussed  only  a  single  interconnection  layer  with  neuron  units  for  in¬ 
puts  and  outputs.  If  such  a  physical  network  includes  feedback,  it  can  be  generalized  to 
functionally  implement  an  arbitrary  multilayer  feedforward  or  recurrent  network.  Figure 
15.23  illustrates  this  principle,  showing  one  physical  layer  of  neuron  units,  one  layer  of 
interconnections  from  the  neuron  units  to  a  set  of  fan-in  nodes,  and  feedback  from  each 
fan-in  node  to  the  corresponding  neuron  unit.  These  neuron  units  can  be  conceptually 
divided  into  groups  corresponding  to  different  functional  layers.  Some  of  the  physical  in¬ 
terconnections  then  represent  functionally  feedforward  connections  (represented  by  solid 
lines  and  boxes  in  Figure  15.23),  and  some  represent  functionally  lateral  connections  within 
a  layer  (represented  by  broken  lines  and  boxes  in  Figure  15.23).  Feedback  connections  to 
previous  layers,  and  feedforward  connections  that  bypass  the  next  subsequent  layer,  can 
also  be  incorporated  in  a  similar  manner,  but  are  not  shown  in  the  figure.  This  technique 
for  implementing  multilayer  networks  using  a  single  physical  layer  has  been  discussed  by 
Farhat  for  the  case  of  1-D  neuron  unit  2urrays  interconnected  by  a  2-D  mask,  and  used 
in  the  implementation  of  parallel  optoelectronic  simulated  annealing  [Farhat,  1987].  Thus 
any  photonic  (single  physical  layer)  architectures  discussed  herein  generalize  to  multilayer 
networks,  provided  that  they  have  capability  for  arbitrary  connections  and  feedback. 


FIGURE  15.23  A  single  layer  physical  neural  network  with  feedback,  used 
to  implement  a  multilayer  recurrent  functional  network.  The  solid  boxes  in¬ 
dicate  feedforward  connections,  and  the  broken  boxes  indicate  lateral  connec¬ 
tions. 


In  summzury,  the  desirable  characteristics  of  a  photonic  implementation  of  neural  net¬ 
works  include:  (1)  modularity,  so  that  multiple  “modules”  can  be  cascaded;  (2)  capability 
for  lateral,  feedforward,  and  feedback  interconnections,  which  can  be  achieved  physically 
by  use  of  a  single  layer  network  with  feedback  and  arbitrary  interconnection  capability; 


(3)  analog,  weighted  connections  with  analog  signads;  (4)  bipolar  signals  and  weights;  (5) 
scalability  to  large  numbers  of  neuron  units  with  high  connectivity;  (6)  generality,  so  that 
different  neuron  models,  network  models,  and  learning  algorithms  can  be  implemented 
within  the  same  basic  technology;  (7)  compatibility  of  different  components  within  a  given 
architecture;  and  (8)  overall  feasibility  of  the  proposed  combination  of  algorithm,  archi¬ 
tecture,  devices,  aind  materials.  In  addition,  the  optical/photonic  hardware  would  ideally 
incorporate  the  following  features:  (1)  simultaneous,  parallel  updates  of  all  interconnection 
weights  at  each  iteration;  (2)  high  optical  throughput;  (3)  low  interconnection  crosstalk; 
and  (4)  flexible  functionality  for  neuron  unit  response,  so  that  different  neuron  models  and 
learning  algorithms  can  be  accommodated. 


AN  IMPLEMENTATION  STRATEGY 

In  this  section  a  photonic  technique  for  the  implementation  of  neural  networks  is  de¬ 
scribed  that  potentially  satisfies  the  aforementioned  desirable  characteristics  and  features 
[Jenkins,  1990a;  Asthana,  1990a;  Jenkins,  1990b;  Asthana,  1990b;  Jenkins,  1990c].  This 
photonic  neural  network  implementation  technique  utilizes  optoelectronic  spatial  light 
modulators  (SLMs)  for  the  2-D  neuron  unit  and  training  term  planes.  Each  neuron  u- 
nit  incorporates  dual  channel  encoding  to  allow  for  the  representation  of  bipolar  input 
and  output  signals,  and  comprises  two  integrated  detectors,  two  modulators  eind  integrat¬ 
ed  electronics.  The  neuron  unit  input  ^uld  output  signals  are  represented  in  the  optical 
system  by  intensity.  The  interconnections  are  based  on  a  3-D  holographic  material  with  a 
novel  incoherent/coherent  recording  and  reconstruction  technique  that  permits  simultane¬ 
ous  updates  of  all  weights  during  each  iteration.  In  addition,  the  interconnections  utilize 
a  unique  double  angular  multiplexing  arrcingement  to  minimize  interchannel  crosstalk  and 
throughput  losses,  in  which  each  pixel  of  the  object  beam  SLM  is  illuminated  by  a  set  of 
mutually  incoherent  beams,  each  at  a  different  angle.  This  implementation  technique  is 
explained  in  the  remainder  of  this  section. 

A  key  feature  of  this  implementation  strategy  is  the  use  of  an  array  of  individually 


coherent  sources  that  are  mutually  incoherent  to  generate  an  array  of  coherent  beam  pairs 
used  for  holographic  recording  and  reconstruction  in  the  interconnection  network.  Consid¬ 
er  the  problem  of  recording  two  holograms,  object  A  recorded  with  reference  beam  xj  and 
object  B  recorded  with  reference  beam  Xj>,  as  shown  in  Figure  15.24(a).  The  objects  A 
and  B  could  each  be  a  2-D  array  of  data.  In  order  to  write  both  holograms  simultaneously, 
A  and  Xj  originate  from  the  same  coherent  source  and  are  mutually  coherent;  similarly 
for  B  and  xy.  However,  B  and  xy  originate  from  a  different  source  than  A  and  Xj,  so 
that  each  pair  is  incoherent  with  respect  to  the  other  pair.  In  this  way,  there  are  no  extra 
(crosstalk)  holograms  written,  such  as  that  between  A  and  xy,  or  between  Xj  and  xy.  This 
technique  c<ui  be  used  for  more  than  two  multiplexed  holograms,  in  which  case  a  separate 
source  is  assumed  for  each  hologram  written. 

FIGURE  1 5.24  Incoherent /coherent  technique  for  recording  and  reconstruct¬ 
ing  multiple  holograms  simultaneously,  in  which  all  solid  lines  represent  mutu¬ 
ally  coherent  beams,  and  all  broken  lines  represent  a  separate  set  of  mutually 
coherent  beams:  (a)  recording;  (b)  reconstruction;  and  (c)  holographic  repre¬ 
sentation,  in  which  each  hologram  represents  the  fanout  from  a  given  neuron 
unit. 

During  reconstruction,  the  holograms  are  illuminated  by  the  seime  set  of  reference 
bctims  Xj  and  xy.  This  simultaneously  reconstructs  the  arrays  A  and  B  (Figure  15.24(b)). 
If  the  arrays  are  in  registry  upon  reconstruction,  a  pixel-by-pixel  incoherent  sum  will  be 
achieved  in  the  output  array.  If  we  now  consider  each  reference  beam  Xj  to  be  the  output  of 
a  neuron  unit  at  the  input  to  an  interconnection  layer,  then  each  reconstructed  hologram 
corresponds  to  the  fan-out  from  one  neuron  unit,  with  a  contribution  lo  each  pixel  in 
the  output  array  proportional  to  the  weight  of  the  corresponding  interconnection.  This 
is  depicted  in  Figure  15.24(b)  and  (c),  in  which  the  two  signals  fanning  in  to  a  given 
neuron  unit  are  derived  from  separate,  mutually  incoherent  optical  sources.  Note  that  this 


technique  provides  an  incoherent  sum  for  the  potential  of  each  neuron  unit  (Equation  (26) 
or  Equation  (27),  depending  on  the  chosen  representation),  as  desired. 

Another  critical  as  well  as  unique  feature  of  the  photonic  architecture  described  herein 
is  a  “double  angular  multiplexing”  technique  in  which  one  input  node  or  pixel  in  the  object 
beam  path  has  multiple  beams  passing  through  it  at  different  angles.  Thus,  a  set  of  an¬ 
gularly  multiplexed  beams  is  introduced  for  e«M:h  object  beam  node  6,,  <is  shown  in  Figure 
15.25.  A  three-fold  angularly  multiplexed  fan-in  from  xi,X2,  and  13  to  yield  neuron  unit 
potentieil  pi  is  depicted  in  this  figure;  solid  lines  represent  mutually  coherent  beams  (all 
dashed  lines  represent  a  mutually  coherent  set  as  well;  similarly  for  mixed  dashed  lines). 
Note  that  this  multiplexing  technique  eliminates  the  fan-in  beam  degeneracy  characteris¬ 
tic  of  collinear  geometries  referred  to  above  in  the  subsection  “Photonic  Interconnections” . 
Thus,  the  ensuing  cross-coupling  terms  are  absent,  and  a  much  more  accurate  set  of  weights 
can  be  recorded  and  reconstructed  at  each  iteration. 


FIGURE  15.25  Doubly  angularly  multiplexed  volume  holographic  optical 
interconnection,  designed  to  circumvent  the  effects  of  beam  degeneracy.  The 
mutually  incoherent  input  beams  ({xj})  are  angularly  multiplexed  over  as 
are  the  corresponding  sets  of  output  beams  from  the  training  plane  ({^P^}) 
generated  by  the  coherent  sources  Sj,  to  produce  an  fingularly  multiplexed  fan- 
in  at  each  summed  output,  thus  yielding  the  neuron  activation  potentials  {pj}. 


A  photonic  architecture  for  neural  network  implementation  that  utilizes  these  princi¬ 
ples  is  shown  in  Figure  15.26,  for  the  case  of  Hebbian  learning  (6,-  =  y,).  The  components 
shown  in  the  figure  comprise  one  module;  inputs  and  outputs  refer  to  this  particular  mod¬ 
ule.  Only  feedforward  connections  are  shown.  The  upper  spatial  light  modulator,  SLMi, 
generates  the  training  terms  Si  that  also  represent  neuron  unit  outputs  in  this  case.  The 
lower  spatial  light  modulator,  SLA/j,  is  the  array  of  input  neuron  units.  An  array  of  co¬ 
herent  but  mutually  incoherent  sources  is  used  to  illuminate  the  system;  they  are  provided 


by  a  mutually  incoherent  laser  diode  array  or  by  a  coherent  beam  passing  through  an  SLM 
that  temporally  modulates  the  phase  of  each  pixel  independently.  (It  can  be  shown  that 
the  latter  method  is  equivalent  to  the  former  for  the  particular  type  of  holographic  record¬ 
ing  and  reconstruction  used  herein.)  A  volume  holographic  material  stores  the  requisite 
weighted  interconnections,  and  can  implement  either  fixed  or  adaptive  interconnections 
depending  on  the  material  used. 


FIGURE  15.26  Photonic  architecture  for  neural  network  implementation 
that  incorporates  a  pau'ailel  source  array,  double  angular  multiplexing,  and  in¬ 
coherent/coherent  recording  and  reconstruction;  the  Hebbian  case  is  depicted. 


Both  spatial  light  modulators  in  Figure  15.26  consist  of  an  array  of  pixels,  each  of 
which  comprises  three  elements;  (1)  two  integrated  detectors  for  input  of  positive  and 
negative  parts  of  the  neuron  unit  activation  potential,  (2)  integrated  electronic  circuitry 
to  provide  the  neuron  unit  (sigmoid  or  hard  threshold)  nonlinearity,  and  (3)  two  hybrid 
or  monolithically  integrated  modulators  for  separate  optical  readout  of  the  positive  and 
negative  neuron  unit  outputs.  The  SLMs,  as  shown,  are  read  out  in  transmission,  and 
have  detectors  situated  so  as  to  receive  optical  inputs  on  the  right  face  of  the  SLM. 

In  the  learning  phase,  the  shutter  is  open  as  shown  schematically  in  Figure  15.27.  Light 
from  each  source  Sj  is  approximately  collimated  so  that  it  illuminates  the  entire  array  on 
SLMi,  at  an  angle  dependent  on  the  position  of  the  j*'*  source.  Thus,  for  an  iV  by  JV  array 
of  sources,  there  are  beams  reading  out  the  contents  of  SLMi  simultaneously,  each  at 
a  different  angle;  the  entire  array  of  terms  {y,-}  is  encoded  onto  each  of  these  beams.  Each 
such  beam  then  interferes  only  with  its  corresponding  reference  beam  Xj,  derived  from  the 
same  source  and  encoded  by  SLM^,  in  the  holographic  medium.  This  writes  the  set  of 
desired  weight  update  terms  axjyi. 
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FIGURE  15.27  Photonic  architecture  for  neural  network  implementation: 
recording  configuration.  This  configuration  implements  the  le2u:ning  function 
in  the  photonic  architecture  of  Figure  15.26.  The  sets  of  beams  emitted  from 
the  source  array  (two  are  shown)  interfere  in  the  volume  holographic  medium 
to  update  the  weights  stored  in  the  interconnection  holograms. 


During  the  computation  phase,  the  shutter  is  closed  to  prevent  learning  as  shown 
schematically  in  Figure  15.28.  The  array  of  sources  is  imaged  onto  SLM2  as  a  set  of 
readout  beams,  so  that  each  individual  source  corresponds  to  one  pixel  (neuron  unit)  on 
the  SLM.  The  SLM  modulates  each  beam  so  that  the  transmitted  beam  has  an  intensity 
proportional  to  the  output  value  of  the  corresponding  neuron  unit.  Thus,  the  source 
illuminates  the  pixel  of  this  SLM,  providing  the  signal  Xj  that  becomes  a  reference 
beam  to  read  out  the  hologram.  This  hologram  reconstructs  an  array  of  spots,  similar 
to  that  depicted  in  Figure  15.24,  that  contribute  to  the  input  of  each  neuron  unit  in  the 
output  plane.  The  optics  is  set  up  so  that  this  array  is  imaged  onto  the  detector  array. 
In  the  complete  neural  network  architecture  of  Figure  15.26,  additional  optical  elements 
(mirror  Mj,  lens  L4  and  beamsplitter  BS2)  are  used  to  displace  the  detector  array  plane 
to  the  detector  side  of  SLMi,  providing  the  neuron  unit  activation  potentials.  In  addition, 
this  beam  is  sent  through  beamsplitter  BS2  to  a  subsequent  layer  in  the  next  module  or 
to  the  output  layer. 


FIGURE  15.28  Photonic  architecture  for  neural  network  implementation: 
reconstruction  configuration.  This  configuration  implements  a  single  forward 
pass  of  the  computing  function  in  the  photonic  architecture  of  Figure  15.26. 
The  lower  set  of  beams  acts  as  a  set  of  reference  beams,  and  generates  a  set  of 
weighted  output  arrays  that  are  imaged  onto  the  detector  array.  Each  stored 
hologram  is  reconstructed  by  a  single  neuron  unit  Xj,  and  fans  out  with  appro¬ 
priate  weights  to  illuminate  the  detector  array.  The  full  set  of  reconstructed 
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holograms  sums  within  each  pixel  to  yield  the  neuron  activation  potentials  {p,  }. 


A  generalized  architecture  that  incorporates  learning  algorithms  of  the  form  of  Equa¬ 
tion  (24)  is  shown  in  Figure  15.29.  Instead  of  SLMi,  as  in  Figure  15.26,  this  architecture 
utilizes  a  training  term  (6.)  generator  that  is  implemented  via  one  or  more  optoelectronic 
SLMs.  In  general,  target  values  U,  actual  neuron  unit  outputs  t/i,  or  possibly  activation 
potentials  pi  may  be  provided  as  inputs  to  the  training  term  generator.  The  physical  ar¬ 
rangement  of  optical  beams  passing  through  the  training  term  generator  (from  left  to  right) 
is  the  same  as  that  shown  passing  through  SLMi  in  Figure  15.26.  Lateral  and  feedback 
connections  can  be  incorporated  by  including  an  optical  feedback  path  from  the  output  of 
the  hologram  to  the  input  side  of  SLM2. 


FIGURE  15.29  GenersJized  photonic  architecture  for  neural  network  imple¬ 
mentation,  including  provision  for  the  generation  of  arbitrary  training  terms 
{Si). 


For  many  applications,  both  SLMi  and  SLM2  can  be  fabricated  using  the  same  tech¬ 
nology.  Let’s  consider  the  case  of  a  sigmoidal  response  with  bipolar  inputs  and  bipolar 
outputs.  The  electronics  within  each  neuron  unit  can  take  the  difference  between  the  two 
detector  inputs  to  yield  the  (bipolar)  neuron  potential.  It  can  then  perform  the  sigmoidal 
nonlinearity,  and  send  the  result  to  appropriate  (positive  channel  or  negative  channel) 
modulator (s).  For  example,  we  have  fabricated  a  number  of  silicon  chips  that  integrate 
the  necessary  control  electronics  with  appropriate  detectors.  One  possible  circuit  that  has 
been  designed  to  incorporate  the  necessary  functionality  is  shown  schematically  in  Figure 
15.30.  Outputs  from  the  two  photodetection  stages  (K„i  and  Knj)  are  differentially  am¬ 
plified  using  two  pairs  of  CMOS  transistors  (M\  —  M3,  M2  —  A/4),  generating  two  separate 
and  complementary  outputs.  The  differential  amplifier  has  been  designed  to  saturate  for 


large  vzilues  of  the  input  signal  difference,  produciiig  the  upper  asymptotic  limit  behavior 
characteristic  of  the  sigmoid  function.  Each  output  signal  is  then  inverted  and  clipped  by 
another  CMOS  transistor  pair  (A/21  -  A/22,  A/n  —  M2),  which  asymmetrizes  the  transfer 
curve  and  adds  the  lower  asymptotic  limit  of  the  sigmoid  function.  Finally,  each  output 
signal  is  inverted  yet  again  and  shifted  in  level  by  a  dual  transistor  sub-unity  gain  amplifier 
stage  (A/23  —  A/24,  A/i3  —  A/14),  producing  complementary  output  signals  (K,uti  and  Kiut2) 
that  control  the  dual  channel  modulation  elements.  External  provision  is  made  in  each 
pixel  (neuron  unit)  for  the  adjustment  of  the  voltage  offset  (zero  crossing  point)  of  each 
characteristic  curve.  This  external  bias  adjustment  allows  for  post-fabrication  fine  tuning 
of  the  overall  response  of  the  circuit,  given  process-induced  variations  in  device  character¬ 
istics. 


FIGURE  15.30  Schematic  diagram  of  a  dual-input,  dual-output  differential 
ampliher  that  effects  a  sigmoid-like  transfer  characteristic. 


A  sample  set  of  characteristic  curves  measured  from  one  of  these  chips  is  shown  in  Fig¬ 
ure  15.31,  with  the  voltage  on  the  second  detector  input  channel  as  the  parameter.  These 
curves  show  the  differential  function  of  the  dual  channel  circuit,  as  well  as  the  desired 
sigmoidal  response  characteristic.  A  6  x  6  array  of  these  100  x  100  neuron  units  has 
«ilso  been  fabricated  with  excellent  uniformity. 


FIGURE  15.31  Experimentally  obtained  trauisfer  characteristics  from  the 
circuit  shown  in  Figure  15.30,  showing  the  output  voltage  in  both  channels 
(Vcn  and  Vouti)  as  a  function  of  one  input  voltage  (Hni),  with  the  other  input 
voltage  (K-„2)  as  a  parameter. 


The  modular  nature  of  this  photonic  neural  net  architecture  can  be  inferred  from  Fig¬ 
ure  15.29;  the  upper  right  SLM  is  SLM^  of  the  subsequent  module.  Feed^  'k  paths  from 
one  module  back  to  previous  modules  can  be  added,  if  desired,  in  a  relatively  straightfor¬ 
ward  manner.  Bipolar  signals  are  incorporated  by  the  dual  channel  nature  of  the  SLMs, 
with  positive  2ind  negative  channels  for  each  neuron  unit.  Since  each  neuron  unit  has  two 
physical  outputs  and  two  physical  inputs,  each  interconnection  between  two  neuron  units 
physically  consists  of  four  separate  weighted  connections  (positive  modulator  to  positive 
detector,  positive  modulator  to  negative  detector,  etc.).  Thus,  even  though  each  physi¬ 
cal  weight  is  nonnegative  in  value,  their  combination  permits  effective  implementation  of 
bipolar  functional  weights.  In  fact,  the  extra  degrees  of  freedom  provided  by  four  indepen¬ 
dent  weights  can  require  revised  weight  update  rules  to  ensure  convergence  of  the  learning 
process  [Petrisor,  1990]. 

With  the  current  spatial  light  modulator  design  at  100  fim  x  100  /im  per  neuron  unit, 
10^  neuron  units  per  cm^  can  be  implemented  on  each  SLM.  By  constructing  an  SLM  as 
a  mosaic  of  such  arrays,  a  3  in  x  3  in  SLM  could  implement  approximately  5  x  10^  neuron 
units.  Each  “tile”  or  small  array  within  such  a  mosaic  need  not  be  carefully  aligned  with 
respect  to  the  other  tiles,  as  the  optical  system  just  images  the  array  back  onto  itself;  in  the 
case  of  multiple  modules  or  lateral/feedback  connections,  there  is,  however,  a  requirement 
that  all  SLMs  are  similarly  tiled,  within  an  appropriate  tolerance  figure.  Note  that  the 
current  design  utilizes  only  2  fim  feature  sizes  in  CMOS;  this  could  eventually  be  scaled 
down  by  a  factor  of  4  in  each  dimension,  yielding  more  than  2m  order  of  magnitude  increase 
in  the  number  of  neuron  units  implementable  per  unit  chip  area  (or  an  equivalent  reduction 
in  the  overall  size  with  the  same  number  of  neuron  units). 

It  should  now  be  clear  that  this  architecture  can  be  generalized  to  implement  certain 
other  neural  models.  The  use  of  electronic  circuitry  for  the  neuron  unit  function  and 
training  term  generation  provides  significant  inherent  flexibility.  For  example,  we  have 
completed  preliminary  designs  of  units  for  forward  and  backward  propagating  signals  in 
a  backpropagation-style  multilayer  neural  network.  Although  the  optical  weight  updates 
in  the  holographic  medium  are  restricted  to  outer-product  terms  (Equation  (24)  in  the 
architecture  as  shown,  variants  of  the  architecture  may  permit  other  learning  scenarios. 


Finaily,  we  consider  the  important  question  of  making  duplicates  of  a  network  that 
has  already  been  trained.  Since  a  volume  hologram  may  store  on  the  order  of  10*°  in¬ 
dependent  weighted  interconnections,  the  preferred  technique  is  to  make  direct  copies  of 
the  multiplexed  volume  hologram.  Here  we  describe  a  technique  for  copying  such  a  mul¬ 
tiplexed  volume  hologram  in  one  step  (Jenkins,  1990c].  To  our  knowledge  this  has  never 
previously  been  achieved,  but  the  use  of  incoherent /coherent  holographic  recording  and 
reconstruction  makes  this  in  principle  quite  straightforward.  Figure  15.32  shows  an  optical 
setup  for  duplicating  the  hologram.  The  master  hologram  is  illuminated  with  the  same 
set  of  reference  beams  as  those  employed  during  exposure;  all  of  the  mutually  incoherent 
reference  beams  illuminate  the  master  volume  hologram  simultaneously,  recalling  2J1  of  the 
stored  holograms  in  parallel.  The  source  array  is  imaged  so  that  it  generates  an  identi¬ 
cal  set  of  reference  beams  on  the  secondary  (copy)  holographic  medium.  Similarly,  the 
reconstructed  object  beams  are  also  imaged,  so  that  they  are  incident  on  the  secondary 
holographic  medium,  with  amplitude  and  phase  identical  to  that  during  recording  of  the 
master  hologram.  The  appropriate  pairs  of  beams  interfere  in  the  secondary  holographic 
medium,  making  a  complete  copy  of  the  original  hologram.  (As  shown  in  Figure  15.32,  the 
copy  will  actually  be  a  spatially  inverted  version  of  the  original.  A  slight  variant  of  the 
optical  system  depicted  in  the  figure  can  produce  a  copy  that  is  identical  to  the  original.) 
Thus  it  is  conceivable  to  mass  produce  copies  of  a  previously  trained  interconnection  pat¬ 
tern,  without  ever  knowing  exactly  what  the  interconnection  weights  are. 


FIGURE  15.32  Optical  layout  for  copying  the  entire  contents  of  a  three- 
dimensional  volume  holographic  optical  element  (VHOE)  into  a  second  VHOE, 
utilizing  a  two-dimensional  array  of  individually  coherent,  but  mutually  inco¬ 
herent  sources. 


We  conclude  this  section  with  a  brief  summary  of  the  current  implementation  status 
of  this  particular  photonic  approach  to  neural  network  fabrication.  For  the  neuron  unit 
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arrays,  6x6  arrays  of  dual-channel  detectors  integrated  with  neuron  function  electronics 
have  been  fabricated  in  silicon  and  operate  correctly.  Individual  multiple  quantum  well 
{InGaAs/GaAs)  modulators  have  been  successfully  fabricated  and  tested,  and  exhibit 
drive  voltages  compatible  with  the  electronics.  The  novel  doubly  angularly  multiplexed  in¬ 
coherent/coherent  interconnection  technique  has  been  tested  experimentally  at  the  level  of 
two  inputs/two  outputs,  and  simulated  at  the  level  of  four  inputs/four  outputs  all  with  very 
favorable  results  [Jenkins,  1990c;  Asthana,  1990b;  Asthana,  1990c].  In  addition,  several 
learning  algorithms  that  incorporate  some  of  the  unique  features  of  the  optical  hardware 
have  been  successfully  designed  and  simulated.  Large  2-D  arrays  of  laser  diodes  that  are 
not  mutually  coherent  have  been  fabricated  recently  [Jewell,  1990;  Orenstein,  1990a;  Von 
Lehmen,  1990).  Photorefractive  crystals  are  routinely  grown  commercially,  and  can  be 
purchased  from  vendors  for  use  at  visible  as  well  as  infrared  wavelengths.  In  addition, 
the  basic  requisite  features  of  the  doubly  angularly  multiplexed  incoherent/coherent  holo¬ 
graphic  recording  techniques  have  been  demonstrated  in  single  crystzds  of  bismuth  silicon 
oxide  {Bii2SiOio),  though  not  as  yet  at  infrared  wavelengths  compatible  with  both  the 
laser  diode  source  array  and  the  multiple  quantum  well  spatial  light  modulators.  All  of  the 
other  components  in  the  architecture  (lenses,  beamsplitters,  etc.)  are  essentially  available 
off  the  shelf. 

As  with  any  research  project  in  progress,  several  questions  pertaining  to  the  photonic 
approach  outlined  herein  remain  partially  unanswered.  Consider,  for  example,  the  inco¬ 
herent/coherent  source  array.  Given  the  current  state  of  the  art  of  laser  diode  arrays,  the 
total  power  dissipation  will  limit  the  number,  maximum  optical  power,  and  spacing  of 
the  individual  sources.  Cross-coherence  among  the  sources  can  cause  undesirable  crosstalk 
among  corresponding  interconnections,  although  in  some  neural  network  models  a  small  to 
moderate  degree  of  interconnection  crosstalk  is  not  likely  to  cause  intolerable  degradation 
in  performance.  Fortunately,  a  larger  spacing  of  sources  implies  that  each  laser  can  output 
a  higher  power,  and  also  assures  a  higher  degree  of  mutueil  incoherence.  Other  remaining 
questions  include  the  achievable  contrast  ratio  and  uniformity  of  the  spatial  light  modu¬ 
lators;  suitable  monolithic  or  hybrid  techniques  for  integrating  detectors,  electronics  and 
modulators;  optimization  of  the  learning  algorithm  relative  to  the  chosen  holographic  ma- 
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terial’s  storage  and  erasure  time  constants;  and  linearity  and  limitations  of  the  hologram 
^  copying  process.  The  next  section  discusses  fundamental  and  technological  limitations  of 

the  photonic  hardware  and  their  impact  on  the  performance  of  photonic  neural  network 
architectures. 

^  FUNDAMENTAL  PHYSICAL  AND  TECHNOLOGICAL 

LIMITATIONS  OF  NEURO-OPTICAL  COMPUTATION 

^  Even  though  we  are  relatively  early  on  in  the  development  of  viable  neuro-optical  com¬ 

puting  systems,  it  is  not  too  early  to  begin  asking  questions  about  the  ultimate  boundaries 
that  may  impact  our  future  achievements.  This  line  of  inquiry  can  have  a  two-  fold  im¬ 
pact.  First,  discovery  of  inherently  fundamental  physical  limitations  that  affect  all  forms 
of  computation  can,  if  correctly  applied  to  the  neural  computational  paradigm,  both  pro¬ 
vide  us  with  an  ultimate  goal  worthy  of  achievement,  and  perhaps  warn  us  in  advance  of 
architectural  choices  that  will  prove  unworthy  of  technological  implementation.  Second, 
I  careful  analysis  of  the  technological  limitations  (device  performance  boundaries  within  a 

given  technological  implementation)  that  affect  system  performance  can  provide  us  with 
necessary  guidance  in  choosing  among  many  possible  implementation  strategies.  The  goal, 
of  course,  is  to  come  up  with  the  right  combination  of  implementation  strategy  and  tech¬ 
nological  choices  to  achieve  the  highest  computational  throughput  (or  perhaps  learning 
rate)  based  on  any  one  of  a  number  of  metrics.  In  this  section,  then,  we  discuss  both  the 
fundamentad  physical  and  technological  limitations  that  impact  the  future  performamce  of 
k  neuro-optical  computational  systems. 


► 


The  Energy  Metric 

Your  brain  is  truly  a  remarkable  instrument  from  a  computational  point  of  view  (as  well 
as  from  many  other  points  of  view!).  Although  estimates  (ats  well  as  individuals!)  vary,  it 
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is  thought  that  your  brain  consists  of  about  10^*  neurons,  each  interconnected  (in  certain 
regions  of  the  brain)  to  10®  -  10*  other  neurons  [Changeux,  1985;  Dowling,  1987;  Hubei, 
1979].  The  human  brain  exhibits  both  short  and  long  term  memory,  performs  sophisti¬ 
cated  image  analysis  in  fractions  of  a  second,  operates  as  an  effective  associative  memory 
integrated  over  a  whole  lifetime  of  learning,  and  yet  operates  on  a  power  budget  that  is 
only  a  fraction  of  the  power  dissipated  by  the  average  light  bulb  in  your  home  [Iversen, 
1979].  In  order  to  accomplish  this,  the  active  switching  elements,  the  neurons,  operate 
at  an  average  power  level  about  seven  orders  of  magnitude  lower  them  that  characteristic 
of  VLSI  logic  circuits  [Mead,  1989b].  If  this  were  not  possible,  it’s  likely  that  you’d  be 
running  a  temperature  even  mthout  the  flu! 

This  discussion  points  to  one  of  many  possible  metrics  by  which  computational  systems 
can  be  judged:  energy  (or  power)  dissipation.  In  fact,  many  modern  supercomputers  are 
limited  in  performance  precisely  because  of  power  dissipation  boundaries,  or  the  ability  to 
extract  the  heat  generated  by  the  computational  process  from  the  volume  used  to  perform 
the  work.  We  can  perhaps  think  of  computation  as  broken  down  into  three  fundamen¬ 
tal  parts:  representation  of  information,  implementation  of  computational  complexity,  and 
detection  of  the  results.  From  the  energy  metric  point  of  view,  everything  costs  energy: 
what  goes  in  costs  energy,  what  comes  out  costs  energy,  and  what  goes  on  in  between  costs 
energy  too.  The  trick  in  building  the  computational  engines  of  the  future  (neural  or  oth¬ 
erwise)  will  be  to  maximize  the  overall  performance  with  a  minimum  expenditure  of  energy. 

Some  Quantum  Limitations 

By  representation  of  information,  we  mean  the  choice  of  data  representation  on  which 
computations  are  performed.  Some  examples  might  include  the  binwy  representation,  M- 
ary  representations,  an  analog  representation,  or  the  residue  representation  [Huang,  1979]. 
This  choice  has  implications  at  the  fundamental  level  for  the  energy  cost  to  represent  a 
number  within  a  given  probability  of  error.  For  example,  if  we  detect  an  optical  signal 
bit  that  is  binary  encoded  with  a  so-called  “ideal”  detector  that  can  tell  the  difference 


between  receiving  exactly  zero  photons  and  one  or  more  photons,  it  only  takes  ten  photons 
on  the  average  to  guaxantee  that  the  signal  is  received  with  a  probability  of  error  of  one 
part  in  a  billion,  or  a  “bit  error  rate  (BER)”  of  10“®.  The  average  photon  at  a  typical 
optical  communications  wavelength  of  1300  nm  has  an  energy  of  only  1.5  x  10"*®  joules,  so 
the  total  energy  cost  per  bit  is  1.5  attojoules  (1.5  x  10"*®  joules).  For  a  communications 
channel  operating  at  10  gigabytes  (8  x  10®  bits)  per  second,  this  implies  a  power  dissipation 
due  to  representation  cost  alone  (without  worrying  yet  about  the  transmission  or  detection 
of  the  information)  of  only  0.12  microwatts.  For  currently  available  detectors,  about  a 
thousand  photons  are  required  to  achieve  the  same  BER,  so  the  necessary  representation 
cost  increases  to  12  microwatts.  In  most  currently  envisioned  communications  systems, 
this  cost  is  overwhelmed  by  other  factors. 

But  what  if  we  chose  to  represent  numbers  in  an  analog  representation  instead?  If  we 
were  to  follow  the  same  kinds  of  quantum  statistical  rules,  we  would  find  that  to  represent 
the  number  “1000”,  say,  with  an  effective  bit  error  rate  of  10"®  requires  about  150  million 
photons  [Tanguay,  1988].  This  is  about  15  million  times  larger  than  the  representation  cost 
of  a  single  binary  bit,  and  about  1.5  million  times  larger  than  the  binary  representation 
cost  of  the  number  1000. 

If  we  assume  that  the  analog  representation  need  only  cover  numbers  between  0  and 
1000,  then  the  dependence  of  the  probability  of  error  on  the  number  of  photons  used  to 
represent  the  highest  number  (1000)  is  given  in  Figure  15.33.  Interestingly,  even  if  we  are 
willing  to  give  up  on  a  couple  of  orders  of  magnitude  of  error  probability,  our  energy  cost 
isn’t  reduced  very  much.  In  fact,  it  costs  about  27  million  photons  to  represent  1000  with 
1%  error,  and  about  11  million  photons  to  represent  it  with  as  much  as  10%  error.  These 
numbers  can  all  be  reduced  by  about  two  orders  of  magnitude  if  we  are  willing  to  give  up 
a  factor  of  ten  in  dynamic  range,  limiting  the  highest  representable  number  to  100  instead 
of  1000,  as  shown  in  the  Figure. 


FIGURE  15.33  The  single  pixel  probability  of  error  P{Err)  as  a  function 
of  the  number  of  photons  detected  within  each  pixel,  for  the  cases  of  100  and 


1000  analog  grey  levels. 


These  errors  arise  fundamentally  from  the  quantum  statistical  nature  of  light,  and  from 
the  fact  that  we  just  caui’t  guarantee  the  number  of  photons  in  a  packet  of  light  (without 
resorting  to  exotic  things  like  “squeezed  states”,  which  have  their  own  practical  limitation- 
s  as  well  as  other  costs).  In  the  brain,  of  course,  it  is  currently  thought  that  many  (but 
not  alt)  of  the  quantities  involved  in  signal  transmission,  both  electrical  and  chemical,  are 
analog  in  nature. 


The  Incorporation  of  Computational  Complexity 

Given  the  fact  that  it  is  considerably  more  expensive  to  represent  quantities  in  analog 
as  opposed  to  binary  form,  why  don’t  we  always  choose  to  compute  in  the  binary  repre¬ 
sentation?  The  answer  is  that  many  operations  are  less  energy  consumptive  to  perform 
in  the  binary  representation,  but  others  are  not.  The  difference  lies  in  the  degree  of  com¬ 
putational  complexity  that  can  be  implemented  on  a  given  representation  for  a  partic  ilar 
computational  operation  within  a  chosen  technological  implementation.  For  our  purposes 
here,  we  may  define  the  computational  complexity  of  a  given  operation  as  the  minimum 
number  of  irreducible  binary  bit  operations  (over  all  possible  computational  algorithms 
and  machine  architectures)  required  to  complete  the  calculation  assuming  that  the  data  is 
represented  in  binary  throughout. 

For  operations  of  low  computational  complexity  such  as  transferring  data  from  the  CPU 
to  memory  or  logic  and  control  operations,  computation  in  the  binary  representation  tends 
to  have  a  significant  energy  consumption  advantage  at  the  fundamental  limits  (as  well  as 
at  the  current  technological  limits  for  both  electronic  and  photonic  processors).  On  the 
other  hand,  for  operations  of  high  computational  complexity  such  as  the  two-dimensional 
Fourier  tr2uisform  that  require  a  very  large  number  of  irreducible  binary  operations  to 
perform  (for  the  optimum  algorithm),  computation  in  the  analog  representation  tends  to 


exhibit  lower  overall  energy  consumption,  particularly  in  photonic  implementations. 


The  Hybrid  Representation  Concept 

In  the  case  of  neural  networks,  a  number  of  characteristic  types  of  computational  op¬ 
erations  are  typically  employed,  including  for  example  the  calculation  of  weight  updates 
and  storage  of  updated  weights,  the  fan-out  of  neuron  unit  outputs,  the  multiplication 
of  fanned-out  outputs  by  weights,  the  communication  of  weighted  signals,  the  fan-in  and 
summation  (or  differencing)  of  weighted  inputs,  and  the  thresholding  of  summed  inputs  to 
form  neuron  unit  outputs.  These  operations  span  a  wide  gamut  of  computational  as  well  as 
physical  complexity.  As  such,  we  suggest  that  optimum  neural  network  performance  from 
an  energy  metric  viewpoint  may  turn  out  to  be  best  achieved  with  a  hybrid  representation, 
in  which  the  signal  representation  is  essentially  binary  for  certain  functions,  and  essentially 
analog  for  others. 

An  example  of  the  use  of  this  hybrid  representation  concept  is  the  use  of  a  hard  thresh¬ 
old  function  within  each  neuron  unit  to  create  a  two  state  output  (on  and  off),  and  the  use 
of  analog  holographic  storage  for  all  interconnection  weights,  as  described  in  a  previous 
section.  In  this  case,  the  inputs  to  each  neuron  unit  are  analog,  while  the  outputs  from  each 
neuron  unit  are  binary.  Multiplications  are  performed  in  a  fully  hybridized  representation 
(multiplicands  are  analog  while  multipliers  are  binary),  but  summations  eire  ^ully  analog 
(the  superposition  of  fanned-in  input  intensities).  Given  a  particular  choice  of  implemen¬ 
tation  technology,  then,  the  overall  power  budget  for  a  given  hybrid  representation  can 
be  established  and  compared  with  similar  power  budgets  for  fully  analog  and  fully  binary 
representations. 


The  Inherent  Costs  of  Interconnections 


Regardless  of  the  representation  chosen,  it  is  clear  that  any  computational  energy  bud  ■ 


get  must  take  into  account  the  non-negligible  cost  of  the  interconnections  themselves. 
Interconnections  characteristic  of  neural  networks  are  merely  a  form  of  weighted  commu¬ 
nication  channels,  characterized  by  a  high  degree  of  fan-out  and  fan-in.  For  both  electronic 
eind  photonic  implementations  that  have  adaptive  weights,  it  takjs  energy  to  calculate  the 
weight  updates,  it  takes  energy  to  store  the  resultant  updated  weights,  it  takes  energy  to 
perform  the  multiplications  implied  by  the  weighting  of  the  output  signals,  and  it  takes 
energy  to  communicate  the  various  signals  between  layers. 

In  the  electronics  case,  these  energy  costs  derive  from  charging  up  the  capacitances  of 
switching  devices  in  the  various  forms  of  memory,  flipping  switches  in  the  various  arith¬ 
metic  operations  (both  addition  and  multiplication)  for  binary  representations,  operating 
linear  and  nonlinear  devices  for  analog  representations,  and  charging  and  discharging  the 
capacitance  associated  with  output  line  drivers  as  well  as  the  capacitance  associated  with 
the  physical  interconnections  (wires)  among  the  various  parts  of  the  circuit.  In  the  pho¬ 
tonics  case,  the  comparable  energy  costs  derive  from  the  generation  of  light  by  coherent 
optical  sources,  the  holographic  recording  of  weights  and  weight  updates,  the  throughput 
losses  engendered  by  readout  of  the  holographically  stored  (and  multiplexed)  interconnec¬ 
tion  matrix,  any  throughput  losses  associated  with  the  fan-out  and  fan-in  processes,  and 
the  inter-and  intralayer  communication  costs. 

The  bottom  line  is  that  in  most  cases,  complex,  highly  multiplexed  interconnection 
networks  with  a  high  degree  of  fan-in  and  fan-out  are  very  energy  consumptive,  and  for 
the  neural  network  case  may  prove  to  be  the  l^^:gest  energy  sink. 


The  Inherent  Cost  of  Detection  (Switching) 

No  useful  computational  system  can  avoid  the  costs  of  detection,  both  of  intermediate 
results  that  are  essential  to  following  calculations,  and  of  the  sought-after  answers  or  output 
states  that  initiate  subsequent  actions  or  analysis.  And  this  is  the  cost  above  all  costs  that 
we  are  certainly  willing  to  pay,  as  answers  or  outputs  validate  the  usefulness  of  the  system 
and  its  design.  As  pointed  out  earlier,  there  are  three  primary  areas  in  which  detections 
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are  essential:  in  the  generation  of  summed  inputs  prior  to  functional  transformation  into 
individual  neuron  unit  outputs  (usually  on  the  input  side  of  an  optically  addressed  spatial 
light  modulator),  in  the  generation  of  specific  system  outputs,  and  in  the  holographic 
recording  of  interconnection  weights. 

The  physical  process  of  detection  inherently  involves  the  dissipation  of  finite  energy  (if 
accomplished  in  finite  time  within  prescribed  uncertainties,  as  is  appropriate  for  compu¬ 
tation),  since  it  necessarily  involves  the  irreversible  switching  of  the  state  of  a  physical 
component,  as  well  as  the  guarantee  that  the  switched  state  will  be  maintained  over  the 
time  period  of  measurement  without  fluctuations  due,  for  example,  to  thermally  induced 
statistical  variations.  This  is  particularly  true  in  highly  distributed  computing  systems 
such  as  neural  networks  that  depend  on  a  certain  degree  of  predictability  and  synchro¬ 
nization  of  communicated  results  for  progressive  computation.  As  such  this  is  not  really 
a  fundamental  physical  limitation,  but  rather  is  a  technological  limitation  imposed  by  the 
system  designer,  who  would  really  like  to  see  some  intelligible  output  from  the  system  in 
the  near  future. 


Optimization  of  the  Computational  Architecture 

The  design  of  a  computational  architecture  in  many  ways  fixes  the  fundamental  perfor¬ 
mance  limitations  of  the  system,  as  choices  must  be  made  about  the  representation  of  data 
within  the  architecture,  the  methods  employed  for  the  implementation  of  computational 
complexity,  and  the  frequency  and  nature  of  the  detections  required  for  both  intermediate 
and  final  results.  For  neural  networks  capable  of  sophisticated  operations,  optimization 
of  the  computational  architecture  against  one  or  more  metrics  (such  as  total  energy  cost 
for  a  given  computation,  or  total  power  at  a  given  operating  frequency)  will  necessitate 
an  appropriate  balance  among  the  various  representations  employed,  cis  well  as  among 
the  physical  mechanisms  employed  to  accomplish  the  necessary  computations.  A  further 
balance  must  be  struck  between  the  fraction  of  the  computational  burden  that  is  assigned 
to  interconnections,  and  the  fraction  that  is  accomplished  by  switching  (whether  logic. 
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arithmetic,  or  detection  of  results). 

^  For  many  classes  of  computational  problems,  the  neur2J  network  paradigm  may  prove 

to  be  nearly  optimal  even  in  the  regime  in  which  all  of  the  individual  components  are 
iissumed  to  be  operating  at  their  respective  fundamental  physical  performance  bound¬ 
aries.  Relative  to  a  modern  digital  supercomputer,  certainly,  neural  networks  seemingly 

\  offer  an  unusual  mix  of  hybrid  representations  (primarily  analog),  interconnections  (highly 

multiplexed  as  well  as  weighted),  and  switching  (infrequent  relative  to  the  rate  at  which 
interconnections  axe  utilized).  It  is  at  the  very  least  intriguing  to  imagine  whether  or  not 
our  biological  heritage  has  stored  within  it  a  useful  clue  about  highly  efficient  computation 
for  truly  sophisticated  problems. 

Technological  Limitations 


The  choice  of  a  technological  base  (or  bases)  within  which  to  design  a  neural  network 
with  a  large  number  of  neuron  units  and  a  high  degree  of  connectivity  implies  yet  anoth- 
\  er  set  of  performance  constraints  above  and  beyond  the  fundamental  physical  boundaries 

refered  to  above.  These  technological  limitations  may  not  yet  have  been  reached  with¬ 
in  the  development  of  a  given  technology,  but  can  at  least  be  estimated  given  what  we 
know  about  the  physics  of  operation  of  the  devices  in  question.  One  such  technological 

I  ...  .  .  • 

limitation,  for  example,  governs  the  total  energy  dissipation  density  that  can  be  tolerated 
on  a  given  semiconductor  substrate  without  either  an  unacceptable  temperature  rise  that 
affects  device  performance,  or  resort  to  extraordinary  cooling  measures  (that  may  prove  to 
I  be  unfeasible  in  an  optical  path).  As  a  second  example,  the  energy  required  to  represent 

a  single  bit  using  current  digit^d  logic  circuits  integrated  in  silicon  is  about  seven  orders  of 
magnitude  above  the  thermal  fluctuation  limit  [Tanguay,  1988]. 

In  a  previous  section  of  this  chapter,  we  discussed  a  particular  photonic  implementation 
strategy  for  neural  networks  that  involved  specific  technological  choices  for  the  various 
types  of  components  required  by  the  architecture.  At  the  present  state  of  development  of 
photonic  computational  systems,  we  do  not  have  the  luxury  of  designing  everything  within 
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a  single  technological  base,  as  is  the  case  perhaps  for  computational  subsystems  based  on 
VLSI  chips.  "Optical  silicon”  has  not  yet  emerged,  or  at  the  very  least  has  not  yet  been 
identified  and  recognized  as  such,  even  though  numerous  candidates  have  been  intensively 
investigated. 

Perhaps  the  leading  candidate  at  the  current  time  is  the  compound  semiconductor 
y  system  based  on  gallium  arsenide  {GaAs)  and  including  related  ternary  compounds  such 

as  indium  gallium  arsenide  (InxCai^j^As)  and  aluminum  gallium  arsenide  (AlxGai-xAs). 
Within  this  system,  at  least,  sources,  source  arrays,  spatial  light  modulators,  integrated 
electronic  circuitry,  volume  holographic  optical  elements,  detectors,  emd  detector  arrays 
^  have  all  been  fabricated  and  evaluated  with  varying  degrees  of  success.  What  has  not 

been  established  to  date  is  the  mutual  compatibility  of  all  of  these  elements  operating 
within  a  given  systems  context.  This  demonstration  of  mutual  compatibility  in  all  rel- 
I  evant  performance  specifications  is  essential,  because  in  a  highly  interconnected  system 

the  overall  performance  achieved  is  often  most  strongly  influenced  by  the  component  with 
the  least  desirable  characteristics.  An  obvious  example  is  that  of  a  single  channel  opti¬ 
cal  communications  link,  for  which  the  transmission  bandwidth  will  be  delimited  by  the 
^  lowest  bandwidth  component  among  the  source/modulator,  transmission  medium,  and 

detector/ amplifier. 

In  the  remainder  of  this  section,  we  briefly  discuss  a  number  of  the  types  of  tech- 
^  nological  limitations  that  will  impact  the  performance  of  currently  envisioned  photonic 

implementations  of  neural  networks. 

With  regard  to  sources  and  source  arrays,  the  principal  technological  issues  are  the 
minimization  of  laser  thresholds  to  allow  for  parallel  operation  of  a  large  number  of  sources 
>  on  a  single  chip,  the  coherence  length  achievable  with  ultrashort  cavity  surface  emitting 

lasers  when  fabricated  in  an  array  (which  impacts  the  holographic  recording  process),  the 
uniformity  of  wavelength  across  the  array  (particularly  for  parallel  readout  of  wavelength 
sensitive  devices  such  as  multiple  quantum  well  spatial  light  modulators),  and  both  the 
short  term  (process- determined)  yield  and  the  long  term  reliability  of  individual  sources 
within  a  large  scale  array. 

For  the  case  of  spatial  light  modulators,  key  technological  issues  include  the  maximum 
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density  of  neuron  units  that  can  be  integrated  on  a  monolithic  or  hybrid  chip  with  ap¬ 
propriate  detectors,  control  circuitry,  and  modulators  within  each  pixel;  the  sensitivity  to 
input  intensity;  the  neural  unit  functionality  (and  perhaps  programmability)  that  can  be 
achieved  at  the  minimum  cost  in  real  estate  and  energy  dissipation;  the  contrast  ratio  and 
uniformity  of  the  contrast  ratio  across  the  array  of  pixels  (neuron  units);  the  achievable 
dynamic  range  of  the  input/output  transfer  function;  and  the  operational  bandwidth  that 
Ctin  be  reached  assuming  a  50%  duty  cycle  for  each  neuron  unit  (which  determines  the 
totcil  power  dissipation  of  the  chip). 

The  high  degree  of  interconnectivity  envisioned  for  photonic  implementations  of  neural 
networks  hinges  primarily  on  the  achievement  of  appropriate  functionality  in  the  volume 
holographic  optical  elements  used  to  record  and  store  interconnection  weights,  produce 
fan-in  and  fan-out  from  each  neuron  unit,  and  allow  for  highly  parallel  readout  of  the 
weighted  interconnection  network.  Key  technological  limitations  for  currently  investigated 
photorefractive  materials  include  the  optical  quality  routinely  achievable  in  large  (1  cubic 
inch)  single  crystal  samples,  the  storage  capacity  of  the  medium  as  determined  by  the 
highest  spatial  frequency  gratings  that  can  be  recorded,  the  sensitivity  for  recording  of 
updated  weights  [Johnson,  1988]  at  the  source  wavelength  (which  in  turn  determines  the 
source  power  necessary  to  initiate  weight  updates  during  the  learning  phase),  the  poten¬ 
tial  for  “fixing”  of  the  stored  interconnection  weights  to  allow  for  nondestructive  readout 
during  computation,  and  the  capability  of  copying  the  contents  of  the  stored  interconnec¬ 
tion  matrix  into  another  holographic  medium  (in  order  to  provide  the  capacity  for  mass 
production  of  fixed  pattern  interconnections  following  a  training  sequence  executed  with  a 
dynamic  medium).  Perhaps  the  most  important  technological  limitation  of  a  given  holo¬ 
graphic  recording  medium  will  prove  to  be  the  total  number  of  weight  update  cycles  that 
can  be  initiated  without  complete  erasure  of  the  weight  updates  recorded  during  the  very 
first  training  cycle.  This  number  in  effect  sets  an  upper  bound  on  the  learning  capacity  of 
the  photonic  neural  network. 

The  primary  technological  limitations  of  importance  to  single  pixel  detectors  as  used, 
for  example,  on  the  input  side  of  optically  adressed  spatial  light  modulators,  include  the 
sensitivity  of  the  detector/amplifier  combination  (which  together  with  the  spatial  light 


modulator  gain  determines  the  overall  loop  gain  for  a  single  computational  iteration),  the 
modulation  bandwidth  in  conjunction  with  following  circuitry,  and  the  chip  area  required 
to  achieve  the  desired  sensitivity  and  bandwidth  tradeoff.  Depending  on  the  technological 
base  within  which  the  spatial  light  modulator  is  fabricated,  the  potential  for  integration 
with  control  circuitry  and  in  some  cases  the  modulation  elements  themselves  provides  an 
additional  constraint. 

For  detector  arrays,  many  of  the  same  issues  apply  with  the  additional  constraints  of 
uniformity  of  each  performance  parameter  across  the  array,  and  the  reliability  of  the  full 
array  of  pixels.  Another  important  technological  issue  is  the  frame  rate  for  readout  of  the 
entire  zirray  at  a  given  pixel  density,  which  is  determined  both  by  the  technological  base 
within  which  the  array  is  fabricated,  and  by  the  physical  structure  of  the  array  and  its 
readout  configuration.  As  discussed  in  an  earlier  section,  charge-coupled-device  (CCD) 
arrays  are  typically  read  out  by  temporally  multiplexing  the  contents  of  a  full  frame  onto 
one  or  a  few  high  bandwidth  serial  outputs.  The  contents  of  each  row  of  stored  charge 
packets  generated  during  the  exposure  cycle  zxe  shifted  out  to  a  high  speed  serial  readout 
buffer,  which  reads  out  one  entire  column  of  pixels  in  between  each  lateral  shift  of  the  rows 
as  shown  schematically  in  Figure  15.34. 


^  FIGURE  15.34  Illustration  of  parallel-to-serial  conversion  in  two-dimensional 

detector  arrays  such  as  the  charge-coupled-device  (CCD)  au-ray.  Charge  accu¬ 
mulated  within  each  photosensitive  region  during  exposure  is  tremsferred  lat¬ 
erally  by  a  set  of  row-parallel  shift  registers  to  a  high  speed  parallel-to-serial 
^  shift  register,  which  reads  out  the  entire  array  one  column  at  a  time. 


This  parallel-to-serial  conversion  limits  the  frame  readout  rate  to  the  maximum  serial 
transfer  rate  achievable  in  the  readout  buffer,  divided  by  the  number  of  pixels  in  the  array. 
For  example,  if  the  array  is  2048  x  2048  pixels  in  size  with  a  readout  buffer  operating  at 
200  MHz,  the  frame  rate  will  be  limited  to  about  60  frames  per  second.  For  many  neural 
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network  applications,  this  frame  rate  may  be  more  than  sufficient  for  access  to  the  desired 
outputs  (including  the  time  required  for  temporal  demultiplexing  of  the  output).  In  cases 
that  demand  higher  fr<ime  rates,  the  array  can  be  segmented  so  that  multiple  readout 
buffers  can  be  used,  each  accessing  a  fraction  of  the  total  number  of  rows  in  the  array. 

An  unusual  feature  of  identifying  the  technological  constraints  that  bound  any  neural 
network  implementation,  whether  it  be  electronic,  photonic,  chemical,  mechanical,  or  all 
of  the  above,  is  the  fact  that  we  just  don’t  know  enough  yet  about  the  operation  of  highly 
interconnected  nonlinear  systems  of  large  dimension  to  fully  assess  the  impact  of  a  par¬ 
ticular  constraint  on  the  overall  system  operation.  One  example  is  the  degree  to  which 
nonuniformities  in  the  neuron  units  themselves  {e.g.  in  their  sensitivity,  contreist  ratio,  or 
overall  response  function)  or  in  the  interconnection  medium  can  be  tolerated  by  an  archi¬ 
tecture  that  is  to  a  large  extent  self-organizing.  A  second  example  is  the  necessity  within 
a  certain  neural  network  paradigm  of  implementing  precisely  the  right  nonlinearity  that 
translates  summed  neuron  inputs  to  neuron  outputs  (or,  for  that  matter,  the  very  exis¬ 
tence  of  nonlinearities  in  the  recording  and  storage  of  weight  updates,  aind  in  the  readout 
of  the  full  weighted  interconnection  pattern).  Some  of  these  types  of  questions  may  be 
amenable  to  simulation,  but  in  other  cases  we  may  have  to  await  the  results  from  actual 
implementations  to  refine  our  understanding  of  the  technological  requirements. 


THE  FUTURE  OF  NEURO-OPTICAL  COMPUTATION 

A  wide  variety  of  photonic  architectures  and  components  are  currently  under  intensive 
investigation  for  neural  network  applications.  A  thorough  discussion  of  these  alternative 
strategies  and  their  strengths  and  weaknesses  is  unfortunately  beyond  the  scope  of  this 
chapter.  In  this  final  section,  we  address  the  future  prospects  of  neuro-optical  computa¬ 
tion  from  the  point  of  view  of  those  critical  issues  that  axe  common  to  all  such  proposed 
implementation  strategies. 


The  Critical  Issues 


As  with  any  emerging  technological  breakthrough,  successful  advtinced  development  of 
photonic  neural  networks  will  require  that  the  technology  prove  to  be  manufacturable,  in 
the  sense  that  it  is  amenable  to  mass  production  techniques  at  reeisonable  cost;  flexible 
in  design,  in  that  the  technological  base  provides  significant  degrees  of  freedom  for  ar¬ 
chitectural  cind  functional  variations;  and  leveraged  as  much  as  possible  by  developments 
in  related  technologies  that  can  offload  a  significant  fraction  of  the  development  time  and 
costs.  Implicit  in  these  three  key  features  is  the  issue  of  the  component  uniformity  that  can 
be  achieved  over  large  array  sizes,  and  the  related  issue  of  the  scalability  of  the  technology 
to  large  scale  systems  either  by  increases  in  the  basic  array  sizes  or  by  the  incorporation 
of  a  modular  design  from  the  outset. 

In  all  useful  computational  systems,  the  bottom  line  is  to  a  large  extent  determined 
by  the  maximum  amount  of  computational  throughput  capacity  that  can  be  squeezed 
into  the  smallest  system  volume  within  a  tolerable  energy  dissipation  constraint  or  power 
budget.  In  the  case  of  photonic  neural  networks  that  utilize  the  implementation  strategy 
outlined  in  a  previous  section  of  this  chapter,  the  two  most  important  factors  that  influence 
the  computational  throughput  capacity  are  the  storage  capacity  of  the  volume  holographic 
optical  element,  and  the  operational  bandwidth  of  the  neuron  units  (nonlinear  spatial  light 
modulators). 

The  storage  capacity  of  a  holographic  interconnection  medium  is  in  turn  determined 
by  the  maximum  number  of  independent  weighted  interconnections  that  can  be  recorded 
and  retrieved  per  unit  volume.  Although  we  discussed  this  issue  in  an  earlier  section  fr  m 
a  theoretical  viewpoint  (and  in  yet  another  section  from  an  architectural  viewpoint),  we 
have  not  addressed  herein  the  even  more  important  question  of  the  actual  density  of  inde¬ 
pendent  interconnections  achievable  with  photorefractive  (or  other  photosensitive  real  time 
materials)  that  contain  a  considerable  number  of  scattering  centers  and  exhibit  diffraction 
efficiencies  that  depend  strongly  on  the  spatial  frequency  of  the  recorded  grating.  Demon¬ 
stration  of  a  high  density  of  weighted  interconnections  with  low  interchannel  crosstalk  in  a 


real  time  holographic  recording  medium  will  provide  a  benchmark  of  achievement  for  pho¬ 
tonic  implementation  strategies,  as  well  as  a  metric  by  which  one  can  more  appropriately 
estimate  eventual  system  performance.  Demonstration  of  parallel  weight  updates  at  high 
sensitivity  (requiring  tolerable  optical  source  intensities)  during  a  complete  training  cycle 
is  also  necessary  for  a  convincing  proof  of  system  feasibility. 

The  operational  bandwidth  of  two-dimensional  spatial  light  modulators  that  can  be 
used  as  neuron  unit  arrays  will  prove  to  be  orders  of  magnitude  larger  than  the  bandwidth 
characteristic  of  biological  systems.  Feasibility  analyses,  as  well  as  preliminary  device 
characterization  studies,  indicate  no  fundamental  or  technological  barriers  to  operation  of 
individual  neuron  units  at  bandwidths  exceeding  100  MHz  [Asthcina,  1990b].  For  a  100  x 
100  element  neuron  unit  to  switch  at  100  MHz  with  a  50%  duty  cycle,  however,  generates 
a  watt  of  power  dissipation  for  every  2  picojoules  of  switching  energy  required  by  an 
individual  neuron  unit,  including  the  detector,  amplifier,  control  circuitry,  and  modulator. 
Although  the  neuron  unit  arrays  in  most  neural  network  architectures  will  not  approach 
50%  duty  cycles  from  full  off  to  full  on  in  actual  operation,  this  still  gives  us  a  very  tight 
energy  dissipation  budget,  and  may  eventually  force  a  lowering  of  the  design  bandwidth. 

For  photonic  implementations  of  neural  networks  that  use  optical  imaging  and  holo¬ 
graphic  interconnection  systems  extensively  to  increase  the  computational  throughput  ca¬ 
pacity,  an  important  question  will  continue  to  be  the  fraction  of  “unfilled”  system  volume 
dedicated  to  wavefront  and  beam  propagation.  Miniaturization  of  many  sophisticated  op¬ 
tical  signal  processing  systems  has  been  a  focus  of  effort  only  recently,  and  can  be  expected 
to  produce  significant  system  volume  reductions  through  clever  (as  well  as  careful)  opto¬ 
mechanical  engineering.  For  example,  gradient  index  (GRIN)  techniques  have  been  used  in 
conjunction  with  photolithographic  planar  processing  to  produce  regular  two-dimensional 
arrays  of  diffraction  limited  microlenses  that  can  be  incorporated  in  stacked  planar  optical 
modules  with  greatly  reduced  unfilled  system  volume  [Iga,  1984].  There  are,  however,  sev¬ 
eral  inherent  limits  (both  fundamental  as  well  as  technological)  that  provide  lower  bounds 
on  the  physical  volume  required  to  implement  a  high  degree  of  interconnectivity  among 
planes  of  neuron  units. 

When  2dl  is  said  and  done,  it  is  certainly  a  fair  question  to  ask  whether  the  physical 


volume  of  a  neuro-optical  computer  module  would  be  better  off  densely  packed  with  sili¬ 
con  chips  that  emulate  the  same  functionality.  All  of  the  preliminary  evidence  gathered  to 
date  suggests  that  the  answer  to  this  question  shifts  rather  dramatically  from  an  emphatic 
“yes”  in  the  limit  of  small  numbers  of  neurons  and  required  interconnections,  to  a  more 
tenuous  “no”  as  the  number  of  neurons  and  density  of  required  interconnections  continues 
to  increase.  Perhaps  the  most  interesting  question  for  the  future  of  neuro-optical  compu¬ 
tation  is  the  clear  identification  of  this  performance  boundary. 


The  Incorporation  of  Neural  Paradigms 

In  concluding  this  chapter  on  neuro-optical  computation,  we  assert  that  although  the 
majority  of  preliminary  demonstrations  of  photonic  neural  network  architectures  have 
seemingly  focused  on  associative  memories  in  general,  and  on  variations  of  the  Hopfield- 
Amari  [Hopfield,  1982;  Amari,  1972]  network  in  particular,  it  is  essential  that  photonic 
implementations  have  the  capacity  for  incorporation  of  a  wide  variety  of  neural  network 
architectures,  computational  algorithms,  and  learning  rules.  This  is  particularly  important 
in  view  of  the  early  stage  of  development  that  characterizes  our  current  understanding  of 
the  operation2d  performance  of  even  the  most  fashionable  neural  network  models,  when 
scaled  up  to  large  numbers  of  densely  interconnected  neuron  units  with  realistic  stochastic 
variations  in  individual  neuron  unit  performance. 

Furthermore,  it  is  likely  that  useful  neural  networks  incorporated  in  a  systems  frame¬ 
work  may  require  either  a  number  of  layers  with  different  characteristics,  or  considerable 
pre-and  post-processing  to  achieve  sophisticated  functionality.  One  example  of  a  neurad 
paradigm  that  requires  such  a^lditonal  sophistication  is  the  Dynamic  Link  Architecture  of 
von  der  Malsburg  discussed  in  Chapter  11  [Buhmann,  1991]  ais  applied  to  pattern  recogni¬ 
tion  problems  by  means  of  graph  matching  techniques.  A  photonic  implementation  of  this 
architecture  will  likely  require  several  interacting  modules  for  complete  functionality. 

Currently  investigated  photonic  architectures  and  components  for  neural  network  im¬ 
plementation  do  not  yet  enjoy  the  flexibility  of  full-fledged  computer  aided  design  and 


computer  aided  manufacturing  that  is  the  hallmark  of  the  silicon  VLSI  circuit  industry. 
On  the  other  hand,  tremendous  strides  have  been  made  in  just  the  past  few  years  in  the 
simulation  of  even  complex  optical  systems  including  the  effects  of  both  refractive  and 
diffractive  components.  The  next  step,  from  extensive  simulation  capabilities  to  design 
automation,  is  under  active  investigation  and  may  when  taken  herald  the  beginnings  of  a 
viable  photonic-based  neural  network  implementation  technology. 
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PROBLEMS 

1.  Consider  a  system  of  beamsplitters  arranged  to  combine  a  set  of  N  input  beams  to 
form  a  single,  collinear  output  beam  with  an  intensity  proportional  to  an  equally 
weighted  sum  of  all  of  the  inputs.  Choose  a  particular  architecture  for  the  beam¬ 
splitter  arrangement,  and  justify  it  in  terms  of  efficiency,  simplicity,  or  minimization 
of  component  count.  For  the  case  of  incoherent  illumination,  derive  the  optimal 
transmissivities  of  the  beamsplitters  in  your  arrangement,  and  prove  that  the  chosen 
architecture  generates  an  input-output  relationship  in  the  form  of  Equation  (2).  Re¬ 
peat  the  analysis  for  the  case  of  coherent  illumination,  and  derive  the  equivalent  of 
Equation  (8)  that  chau'acterizes  the  chosen  architecture. 

2.  Two  mutually  coherent  beams  of  intensities  |op  and  |6|^  are  incident  on  a  detector. 
The  coherent  superposition  of  the  beams  is  given  by,  in  one  dimension, 

A(z)  =  ac-'***  -I- 

(a)  Plot  the  resulting  intensity,  (i4(x)|^,  as  a  function  of  x. 

(b)  Show  that  the  integral  of  |A(x)(*  over  an  integral  number  of  its  periods  is  equal 
to  |ap  l&l^,  thus  proving  that  the  detector’s  response  is  equal  to  the  incoherent 
sum  of  the  individual  beams. 

3.  For  the  case  of  a  thin  phase  grating  with  a  sinusoidal  index  modulation  given  by 
n(x)  =  no  -f-  n\sinkax  with  nj  <  no,  calculate  the  value  of  nid  that  maximizes 

no 
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the  diffraction  efficiency  into  the  first  diffracted  order,  and  the  maximum  diffraction 
efficiency  achievable.  Assume  that  the  grating  is  read  out  by  a  semiconductor  laser 
with  a  wavelength  of  850  nm.  For  a  given  incident  intensity  of  the  optical  readout 
beam,  calculate  the  ratio  of  the  intensity  diffracted  into  the  O'*  ,  2"*^,  and  3’’*'  diffracted 
orders  to  that  diffracted  into  the  1^‘  order. 

4.  Consider  the  process  of  diffraction  from  a  thin  amplitude  grating  with  a  spatially 
varying  transmissivity  and  negligible  phase  modulation.  For  a  sinusoidal  transmit¬ 
tance  modulation  given  by  t{x)  =  to  tisinkax  with  <i  <  to,  calculate  the  value 
of  ti  that  maximizes  the  diffraction  efficiency  into  the  first  diffraicted  order,  and  the 
maximum  diffraction  efficiency  achievable.  Assume  that  the  grating  is  read  out  by 
a  semiconductor  laser  with  a  wavelength  of  850  nm.  For  a  given  incident  intensity 
of  the  optical  readout  beam,  calculate  the  ratio  of  the  intensity  diffracted  into  the 
0‘*  ,  2”*',  and  3^^  diffracted  orders  to  that  diffracted  into  the  1*'  order.  Discuss  the 
essential  differences  observed  between  the  amplitude  and  phase  grating  cases. 

5.  Consider  the  process  of  holographic  grating  recording  in  a  thick  holographic  recording 
medium.  Assume  that  the  grating  is  recorded  and  read  out  by  a  semiconductor  laser 
with  a  wavelength  of  850  nm,  and  that  the  angle  included  between  the  two  recording 
beams  is  30  degrees.  What  is  the  spatial  frequency  of  the  recorded  grating?  How 
thick  must  the  hologram  be  in  order  to  generate  a  grating  parameter  Q  (as  defined  in 
the  text)  of  1,000?  What  is  the  approximate  angular  width  of  this  recorded  grating 
(as  measured,  for  example,  by  varying  the  angle  of  incidence  of  the  readout  beam)? 

6.  For  the  thick  holographic  grating  described  in  Problem  4,  calculate  the  amplitude 
of  the  refractive  index  modulation  that  is  necessary  to  achieve  100%  diffraction  ef¬ 
ficiency  on  readout.  How  large  an  absorption  coefficient  can  be  tolerated  in  the 
holographic  recording  medium  at  the  readout  wavelength  if  the  absorption  loss  in 
diffraction  efficiency  is  to  be  kept  below  5%  ? 

7.  Consider  the  holographic  interconnection  scheme  depicted  in  Figure  15.9.  First, 
derive  the  basic  relationship  for  a  lens  that  associates  a  given  point  pi  in  the  input 
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plane  with  a  resulting  beam  angle  6i .  Given  a  focal  length  of  5  centimeters  for  lenses 
Li  and  Z/j,  what  is  the  minimum  spacing  required  between  nearest  neighbor  points  in 
the  input  plane  for  a  grating  Q  of  1,000?  What  Q  will  be  required  to  accommodate 
of  order  10^  input  positions? 

8.  Design  a  differentiating  circuit  for  incorporation  in  an  optically  addressed  spatial  light 
modulator,  using  the  principles  of  Chapters  14  and  15.  Assume  that  the  detector 
is  a  p-i-n  photodiode,  and  that  the  modulator  can  be  treated  as  a  purely  capacitive 
load.  If  the  modulator  can  be  modeled  as  a  parallel  plate  capacitor  of  dimensions 
30  X  50  pm,  with  a  thickness  of  1  pm  and  a  relative  dielectric  constant  typical  of 
gallium  arsenide  multiple  quantum  well  devices  (e  =  13),  estimate  the  bandwidth 
over  which  the  differentiator  is  operational. 

9.  If  storage  of  one  synaptic  weight  in  VLSI  requires  a  memory  element  15  pm  x  15  pm 
in  size,  what  is  the  maximum  number  of  synaptic  weights  that  can  be  implemented 
on  a  1  cm  X  1  cm  chip?  If  a  VLSI  neural  network  is  fully  connected  using  one  such 
memory  element  for  each  synapse,  how  many  pins  are  required  for  input  to,  and 
output  from,  the  set  of  neuron  units?  How  many  pins  would  be  required  for  parallel 
input  of  the  weights?  If  an  optical  synaptic  weight  can  be  implemented  in  an  effective 
volume  of  5  pm  x  5  pm  x  5  pm,  what  is  the  maximum  number  of  synaptic  weights 
that  can  be  implemented  in  a  volume  1  cm  x  1  cm  x  1  cm? 

iO.  (a)  If  a  neural  network  is  simulated  on  a  digital,  sequential  machine,  how  metny  mul¬ 
tiply  operations  and  add  operations  are  required  to  simulate  the  computational 
process  of  a  single-layer  feedforward  network  with  N  neuron  units  and  a  connec¬ 
tivity  (number  of  connections  per  neuron  unit)  of  M?  If  a  multiply  operation 
can  be  performed  in  100  ns  and  an  add  operation  in  25  ns,  what  is  the  minimum 
time  it  will  take  for  one  peiss  through  the  network  li  N  =  10®  and  M  =  10^? 

(b)  For  the  outer  product  learning  of  Equation  (24),  neglecting  the  decay  term, 
how  many  multiply  operations  and  «idd  operations  are  required  for  one  iteration 
of  weight  updates,  in  terms  of  M  and  N1  For  a  two-layer  network  (1  hidden 
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layer),  with  each  layer  having  N  =  10®  and  M  =  lO"*,  and  assuming  10“*  different 
patterns  presented  1,000  times  each,  how  long  will  the  network  take  to  be  trained 
(assuming  1  forward  pass  and  1  update  of  all  weights  per  presentation,  100  ns 
per  multiply  operation  and  25  ns  per  add  operation)? 

(c)  For  the  same  numbers  of  (b),  assuming  each  layer  of  a  photonic  system  can 
perform  a  forward  p<iss  in  100  ns  and  a  set  of  parallel  weight  updates  in  1  /zs, 
how  long  will  it  take  to  be  trained? 

11.  (a)  Design  an  algorithm  that  uses  only  nonnegative  signals  outside  of  each  neuron 

unit,  nonnegative  weights,  and  allows  two  separate  inputs  to  and  two  separate 
outputs  from,  each  neuron  unit.  It  should  be  able  to  perform  neural  computation 
and  weight  updates  for  learning,  /our  answer  should  be  in  the  form  of  a  flow 
chart.  You  may  perform  subtraction  and  division  only  within  each  neuron  unit; 
only  addition,  multiplication,  interconnection  and  storage  can  be  performed 
external  to  the  neuron  units.  (No  need  to  simulate.) 

(b)  After  many  iterations  during  learning,  might  there  be  a  problem  with  weights 
saturating  or  going  out  of  bounds?  If  not.  why  not?  If  so,  conjecture  as  to  how 
this  problem  might  be  avoided. 

12.  In  regard  to  Equation  (24),  find  an  expression  for  Si  for  the  following  algorithms: 

(a)  Perceptron 

(b)  Widrow-Hoff 

(c)  Least  minimum  square  (back  propagation),  for  a  multi-layer  net  for 

(i)  output  layer 

(ii)  hidden  layers 

Assume  that  /?  =  0  for  this  problem. 

(Note:  This  problem  requires  familiarity  with  neural  networks  not  discussed  in  this 
chapter.) 
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13.  Referring  to  the  system  of  Figure  15.21,  if  the  interconnection  mask  SLM  were  elec¬ 
tronically  addressed  with  a  serial  line,  capable  of  transmitting  analog  values  at  50 
MHz,  and  there  are  10®  pixels  (analog  weights),  what  is  the  maximum  frame  rate? 
If  there  are  10^  parallel  lines  addressing?  If  the  SLM  is  optically  addressed,  what 
limits  the  frame  rate? 

14.  (a)  Referring  to  Figure  15.23,  show  how  the  following  network  can  be  drawn  as  a 

single  layer  network  with  feedback. 

<<  Insert  Figure  for  Problem  14(a)  >> 

(b)  How  can  the  architecture  shown  in  Figure  15.26  be  modified  to  include  feedback 
connections  within  the  module?  Sketch  the  resulting  architecture. 

15.  Consider  a  charge-coupled-device  array  as  shown  schematically  in  Figure  15.34,  of 
dimension  1000  x  1000  pixels  with  a  serial  readout  buffer  that  operates  at  a  clock 
frequency  of  100  MHz.  Calculate  the  maximum  frame  rate  achievable,  and  the  speed 
required  of  the  row  shift  registers.  Calculate  the  ratio  between  the  total  storage  time 
required  of  the  first  pixel  read  out  to  that  of  the  last  in  each  frame. 


FIGURE  CAPTIONS 


Fig.  15.1  Illustration  of  optical  addition  utilizing  a  50/50  beamsplitter:  (a)  collinear 
incoherent  beam  geometry;  (b)  collinear  coherent  beam  geometry,  showing  input  and 
output  amplitudes;  (c)  collinear  coherent  beam  geometry,  showing  input  and  output 
intensities. 

Fig.  15.2  Illustration  of  optical  addition  utilizing  mirrors:  (a)  angularly  multiplexed  in¬ 
coherent  beam  geometry;  (b)  angularly  multiplexed  coherent  beam  geometry. 

Fig.  15.3  Illustration  of  optical  multiplication  utilizing  a  medium  with  variable  trans¬ 
parency. 
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Fig.  15.4  Fundamental  principles  of  spatial  light  modulator  function:  (a)  block  diagram 
of  the  principad  functions  of  an  optically-addressed  spatial  light  modulator,  including 
the  detection,  functional  implementation,  and  modulation  functions;  (b)  schematic 
diagraun  of  an  iV  x  N  array  of  spatiail  light  modulator  pixels,  in  which  three  pixels 
are  shown  in  different  transmission  states;  (c)  expanded  view  of  the  pixel  array, 
showing  an  incomplete  fill  factor  within  each  pixel;  (d)  expanded  view  of  a  single 
pixel  within  the  array,  illustrating  one  possible  pixel  configuration  that  incorporates 
two  detector  elements  Di  and  D2,  control  electronics  for  impedance  matching  and 
functional  implementation,  and  two  modulator  elements,  shown  here  in  different 
transmittance  states. 

Fig.  15.5  Extimples  of  monolithically-integrated  spatial  light  modulators.  The  chosen 
examples  incorporate  photodetectors,  control  circuitry,  and  multiple  quantum  well 
modulators  within  each  pixel  on  a  single  gallium  arsenide  (GaAs)  substrate.  In 
(a),  the  control  electronics  and  photodetector  elements  «ire  fabricated  following  the 
photolithographic  definition  and  physical  isolation  of  the  modulator  elements,  while 
in  (b)  a  buffer  (isolation)  layer  is  used  to  allow  fabrication  and  interconnection  of  all 
of  the  elements  without  chemical  or  ion  beam  etching. 

Fig.  15.6  Example  of  a  hybrid  spatial  light  modulator,  in  which  the  photodetectors  and 
control  electronics  are  fabricated  on  a  silicon  substrate,  and  the  multiple  quantum 
well  modulator  elements  are  fabricated  on  a  gallium  arsenide  {GaAs)  substrate.  The 
two  sets  of  devices  are  bump  contacted  on  a  pixel-by-pixel  basis  to  provide  parallel 
electrical  continuity. 

Fig.  15.7  VLSI  layout  of  a  generalizable  silicon-based  spatial  light  modulator  structure: 
(a)  neuron  pixel  layout;  (b)  photograph  of  a  single  neuron  unit  in  VLSI  implementa¬ 
tion,  with  probe  pads  substituted  for  the  two  detectors  (bottom)  and  for  contact  to 
the  two  modulation  elements  (top);  (c)  photograph  of  a  6  x  6  array  of  neuron  units 
on  a  VLSI  chip  that  incorporates  additional  test  circuitry. 

Fig.  15.8  A  simplified  holographic  recording  configuration:  case  of  plane  wave  signal  and 
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reference  Kearns,  and  a  thin  holographic  recording  medium;  (a)  recording,  and  (b) 
reconstruction  with  a  plane  wave  readout  beam. 

Fig.  15.9  A  point-to-point  interconnection  system,  using  a  holographic  optical  elemen- 
t  (HOE)  for  interconnection  routing,  and  lenses  «is  position-to-angle  and  angle-to- 
position  encoders.  In  this  example,  the  holographic  optical  element  effectively  per¬ 
forms  an  input  angle  to  output  angle  transformation,  such  that  light  emitted  (or 
transmitted)  at  point  pi  in  the  input  plane  (Pj)  is  detected  at  point  p2  in  the  output 
plane  (Pj)- 

Fig.  15.10  Volume  holographic  recording  with  plane  wave  signal  and  reference  beams;  (a) 
recording,  and  (b)  reconstruction,  showing  the  elimination  of  the  higher  diffracted 
orders. 

Fig.  15.11  The  angular  alignment  sensitivity  of  a  volume  holographic  optical  element, 
2is  a  function  of  the  dimensionless  (^-parameter  defined  in  the  text.  The  grating 
strength  for  all  of  the  curves  (3.14  radians)  is  optimized  to  produce  100%  diffraction 
efficiency  in  the  limit  of  large  Q  (Bragg  diffraction  regime),  and  is  not  optimized 
for  low  Q  gratings.  Note  that  the  diffraction  efficiency  is  essentially  independent  of 
angle  for  low  Q  gratings,  and  is  very  strongly  peaked  at  the  Bragg  angle  (7.5  degrees 
in  this  case)  for  high  Q  gratings. 

Fig.  15.12  The  diffraction  efficiency  of  thin  (Raman-Nath  diffraction  regime)  and  thick 
(Bragg  diffraction  regime)  holographic  gratings  as  a  function  of  the  grating  strength. 

Fig.  15.13  Schematic  representation  of  a  4  input,  4  output  holographic  interconnection, 
showing  4  coherent  input  beams  zi-i^  and  4  coherent  recording  beams  yi-y4,  each 
of  which  corresponds  to  a  desired  output  yJ-j/v  In  (a),  the  sets  {xj}  and  {y,}  inter¬ 
fere  within  the  volume  holographic  medium,  recording  the  desired  interconnection 
diffraction  gratings.  In  (b),  a  new  set  of  input  beams  {ij}  illuminates  the  volume 
holographic  medium,  reading  out  the  weighted  interconnection  pattern  and  forming 
appropriately  weighted  sums  at  each  of  the  outputs  {y,'}. 
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Fig.  15.14  Schematic  representation  of  the  fan-out  process  for  optical  beams,  for  the  case 
of  one  input  and  three  outputs:  (a)  with  beamsplitters  {BS^  -  BS3);  (b)  with  a  single 
holographic  optical  element  containing  three  multiplexed  (spatially  superimposed) 
diffraction  gratings. 

Fig.  15.15  Schematic  representation  of  the  fan-in  process  for  optical  beams,  for  the  case 
of  three  angularly  distinct  inputs  and  one  combined  collinear  outpu.t  beam:  (a)  with 
beamsplitters,  showing  the  unavoidability  of  a  throughput  loss  associated  with  the  set 
of  transmitted  (and  multiply  reflected)  beams;  (b)  with  a  single  holographic  optical 
element  containing  three  multiplexed  (spatially  superimposed)  diffraction  gratings, 
showing  an  analogous  throughput  loss. 

Fig.  15.16  Illustration  of  the  generation  of  crosstalk  in  holographic  optical  interconnec¬ 
tions  due  to  beam  degeneracy:  recording/readout  configuration.  The  input  beams 
{ij}  are  assumed  to  have  interfered  within  the  volume  holographic  medium  with  the 
set  of  recording  beams  {j/,},  producing  the  desired  set  of  interconnection  gratings 
with  weights  Wij.  Illumination  of  the  volume  holographic  medium  with  beam  x\  pro¬ 
duces  a  1  to  4  fanout  into  the  output  beams  as  well  as  the  zeroth  order  beam 
x[.  Due  to  the  effects  of  beam  degeneracy,  power  is  also  coupled  into  the  zeroth 
order  beams  and  crosstalk  terms  {c,}  are  introduced  into  the  outputs. 

Fig.  15.17  Illustration  of  the  generation  of  crosstalk  in  holographic  optical  interconnec¬ 
tions  due  to  beam  degeneracy:  diffraction  efficiency  ais  a  function  of  grating  strength 
for  the  readout  configuration  of  Figure  15.16.  Shown  are  the  depletion  of  the  zero 
order  beam  x\  and  the  rise  of  the  desired  output  beams  y,',  accompanied  by  a  strong 
buildup  of  the  cross-coupled  beams 

Fig.  15.18  Illustration  of  a  surface-emitting  laser  diode  source  array  [after  Jewell,  1990]. 
In  this  example,  the  individual  semiconductor  laser  diodes  are  isolated  by  chemical¬ 
ly  assisted  ion  beam  etching  techniques,  must  be  individually  contacted,  and  emit 
through  the  GaAs  substrate. 
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Fig.  15.10  Schematic  diagram  of  a  photodarlington  pair  utilized  as  a  high  gciin  detec¬ 
tor/amplifier  combination. 

Fig.  15.20  Schematic  diagram  of  a  charge  coupled  device  (CCD)  photodetector  array 
fabricated  on  a  silicon  substrate.  Electrostatic  potential  wells  are  created  by  applica¬ 
tion  of  appropriate  voltages  to  the  three  phase  bias  electrode  structure,  with  electrical 
isolation  provided  by  the  gate  oxide  layer.  Light  incident  through  the  trainsparent 
electrodes  creates  stored  charge  that  can  be  transferred  to  an  output  signal  terminal 
by  proper  sequential  phasing  of  the  bias  voltages  {Pi  -  P3). 

Fig.  15.21  Block  diagram  of  a  1-D  to  1-D  photonic  neural  network,  in  which  a  one¬ 
dimensional  neuron  array  is  fully  interconnected  to  a  one-dimensional  detector  array 
by  means  of  a  two-dimensional  interconnection  mask. 

Fig.  15.22  Block  diagraun  of  a  2-D  to  2-D  photonic  neural  network,  in  which  a  two- 
dimensional  neuron  array  is  fully  interconnected  to  a  two-dimensional  output  array 
by  means  of  a  three-dimensional  volume  holographic  optical  interconnection  mask. 
The  input  plane,  output  plane,  and  optional  training  plane  are  shown.  Many  variants 
of  this  geometry  with  similar  properties  are  possible. 

Fig.  15.23  A  single  layer  physical  neural  network  with  feedback,  used  to  implement  a 
multilayer  recurrent  functional  network.  The  solid  boxes  indicate  feedforward  con¬ 
nections,  and  the  broken  boxes  indicate  lateral  connections. 

Fig.  15.24  Incoherent /coherent  technique  for  recording  and  reconstructing  multiple  holo¬ 
grams  simultaneously,  in  which  all  solid  lines  represent  mutually  coherent  beams,  and 
all  broken  lines  represent  a  separate  set  of  mutually  coherent  beams:  (a)  recording; 
(b)  reconstruction;  and  (c)  holographic  representation,  in  which  each  hologram  rep¬ 
resents  the  fanout  from  a  given  neuron  unit. 

Fig.  15.25  Doubly  angularly  multiplexed  volume  holographic  optical  interconnection,  de¬ 
signed  to  circumvent  the  effects  of  beam  degeneracy.  The  mutually  incoherent  input 
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beams  ({x,})  are  angul^u:ly  multiplexed  over  j,  as  are  the  corresponding  sets  of  out¬ 
put  beams  from  the  training  plane  ({^P})  generated  by  the  coherent  sources  5,,  to 
produce  an  angularly  multiplexed  fan-in  at  each  summed  output,  thus  yielding  the 
neuron  activation  potentials  {p,}. 

Fig.  15.26  Photonic  architecture  for  neural  network  implementation  that  incorporates  a 
parallel  source  array,  double  angular  multiplexing,  and  incoherent /coherent  recording 
and  reconstruction;  the  Hebbian  case  is  depicted. 

Fig.  15.27  Photonic  architecture  for  neural  network  implementation:  recording  configu¬ 
ration.  This  configuration  implements  the  learning  function  in  the  photonic  Mchi- 
tecture  of  Figure  15.26.  The  sets  of  beams  emitted  from  the  source  array  (two  are 
shown)  interfere  in  the  volume  holographic  medium  to  update  the  weights  stored  in 
the  interconnection  holograms. 

Fig.  15.28  Photonic  architecture  for  neural  network  implementation:  reconstruction  con¬ 
figuration.  This  configuration  implements  a  single  forw<ird  pass  of  the  computing 
function  in  the  photonic  architecture  of  Figure  15.26.  The  lower  set  of  beams  acts 
as  a  set  of  reference  beams,  and  generates  a  set  of  weighted  output  airrays  that  are 
imaged  onto  the  detector  array.  Each  stored  hologram  is  reconstructed  by  a  single 
neuron  unit  Xj,  and  fans  out  with  appropriate  weights  to  illuminate  the  detector 
array.  The  full  set  of  reconstructed  holograms  sums  within  each  pixel  to  yield  the 
neuron  activation  potentials  {pi}. 

Fig.  15.29  Generalized  photonic  architecture  for  neural  network  implementation,  includ¬ 
ing  provision  for  the  generation  of  arbitrary  training  terms  (6,)  . 

Fig.  15.30  Schematic  diagram  of  a  dual-input,  du£il-output  differential  amplifier  that 
effects  a  sigmoid-like  transfer  characteristic. 

Fig.  15.31  Experimentally  obtained  transfer  characteristics  from  the  circuit  shown  in  Fig¬ 
ure  15.30,  showing  the  output  voltage  in  both  channels  and  Vouti)  as  a  function 
of  one  input  voltage  (Kni)i  with  the  other  input  voltage  (Kn2)  as  a  parameter. 


Fig.  15.32  Optical  layout  for  copying  the  entire  contents  of  a  three-dimensional  vol¬ 
ume  holographic  optical  element  (VHOE)  into  a  second  VHOE,  utilizing  a  two- 
dimensional  Mray  of  individually  coherent,  but  mutually  incoherent  sources. 

Fig.  15.33  The  single  pixel  probability  of  error  P(Err)  as  a  function  of  the  number  of 
photons  detected  within  each  pixel,  for  the  cases  of  100  and  1000  analog  grey  levels. 

Fig.  15.34  Illustration  of  parallel-to-serial  conversion  in  two-dimensional  detector  arrays 
such  as  the  charge-coupled-device  (CCD)  array.  Charge  accumulated  within  each 
photosensitive  region  during  exposure  is  transferred  laterally  by  a  set  of  row-parallel 
shift  registers  to  a  high  speed  parallel-to-serial  shift  register,  which  reads  out  the 
entire  array  one  column  at  a  time. 
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Numerous  applications  have  been  envisioned  for  spatial  light  modulators  in  optical 
information  processing  and  computing  systems  [1,  2].  These  applications  can  be  sum¬ 
marized  within  the  context  of  a  generalized  optical  information  processor  or  computer  as 
shown  schematically  in  Fig.  I  [2].  The  principal  functional  roles  of  both  one-  and  two- 
dimensional  spatial  light  modulators  include  those  of  format,  input,  output,  CPU,  and 
memory  devices.  In  addition,  spatial  light  modulators  can  be  effectively  utilized  to  provide 
certain  types  of  feedback  interconnections,  as  for  example  in  the  case  of  optical  crossbar 
switches  and  holographically  encoded  weighted  interconnections. 

Such  a  wide  variety  of  applications  has  of  course  led  to  an  equally  wide  variety  of  in¬ 
terrelated,  and  at  times  conflicting,  device  requirements.  In  this  presentation,  we  examine 
these  requirements  from  three  complementary  perspectives:  fundamental  physical  limita¬ 
tions  that  affect  the  performance  of  any  spatial  light  modulation  function;  the  current 
status  of  spatial  light  modulator  development  with  respect  to  such  fundamental  limits; 
and  technological  considerations  that  impact  present  and  future  device  design  and  devel¬ 
opment.  Each  of  these  three  perspectives  will  be  discussed  in  detail,  and  is  outlined  briefly 
below. 

Study  of  the  fundamental  physical  limitations  that  affect  an  emerging  technology  is  at 
once  an  exciting  and  somewhat  sobering  endeavor.  The  excitement  arises  naturally  from 
the  discovery  of  what  we  can  in  fact  achieve;  the  sobering  impact  often  occurs  with  the  real¬ 
ization  of  what  We  have  in  fact  achieved.  Numerous  such  fundamental  physical  limitations 
pertain  to  the  process  of  spatial  light  modulation.  For  example,  two  principal  attributes 


of  incident  wavefronts  can  be  conveniently  modulated:  amplitude  and/or  phase.  In  most 
applications,  it  is  desirable  to  modulate  one  or  the  other,  but  not  both.  Yet  these  two 
parameters  are  intimately  related,  such  that  modulation  of  one  has  a  deterministic  impact 
on  the  other  through  the  Kramers-Kronig  relations.  Analysis  of  this  interrelationship  can 
yield  fundamental  limits  on  the  phase/amplitude  cross-talk  anticipated  for  various  physical 
device  configurations,  as  well  as  appropriate  figures  of  merit.  Furthermore,  such  analysis 
reveals  the  potential  for  “dispersion  engineering”,  in  which  one  can  imagine  utilizing  a 
technology  such  as  compound  semiconductor  multiple  quantum  wells  to  tailor  dispersion 
and/or  absorption  curves  to  match  device  requirements.  One  such  example  is  illustrated  in 
Fig.  2,  which  shows  the  related  absorption  and  dispersion  curves  for  two  5  meV  linewidth 
Lorentzian  oscillators  displaced  by  15  meV.  As  can  be  seen,  a  region  of  nearly  linear  index 
variation  with  energy  can  be  generated  at  a  local  minimum  in  the  absorption  profile. 

A  second  important  fundamental  limitation  pertains  to  the  minimum  (quantum  limited) 
energy  required  per  unit  resolution  element  to  achieve  a  given  level  of  modulation  within 
predetermined  accuracy  constraints.  Different  limits  can  be  derived  for  both  analog  and 
digital  spatial  light  modulation  [2,  3,  4,  5],  as  well  as  for  the  important  cases  of  optically 
addressed  and  electrically  addressed  devices.  For  purposes  of  discussion,  consider  the 
case  of  optically  addressed  spatial  light  modulators,  for  which  the  modulation  function 
inherently  involves  a  detection  process  within  each  resolution  element.  The  quantum  limits 
for  binary  switching  are  well  known,  and  are  sunrunarized  for  optical  devices  in  Fig.  3  (after 
P.  W.  Smith,  Ref.  |6]).  For  analog  modulation,  quantum  restrictions  place  much  stricter 
boundary  limitations  on  the  minimum  allowable  photon  flux  for  a  given  pixel  resolution, 
error  rate,  and  device  framing  rate  [2,  3,  4,  5]. 

By  comparison,  analog  representations  (as  used  extensively,  for  example,  in  optical 
processors)  require  far  more  energy  than  the  binary  equivalent.  This  is  due  to  the  neces¬ 
sity  of  utilizing  a  much  higher  particle  count  (electrons  or  photons)  in  order  to  minimize 
the  effects  of  quantum  statistical  fluctuations  on  the  bit  error  rate  (BER).  For  exam¬ 
ple,  if  we  wish  an  analog  representation  of  the  number  1,000,  then  we  require  a  dynamic 
range  of  at  least  1,000:1.  For  incoherent  illumination,  quantum  fluctuations  in  the  emis¬ 
sion/detection  process  produce  a  photon  number  distribution  with  a  relative  standard 
deviation  of  <7  S  y/N/N.  The  equivalent  of  a  10~*  BER  for  the  digital  case  corresponds 
to  roughly  12  standard  deviations.  Therefore,  the  number  of  photons  required  must  be 
greater  than  1.5  x  10*  from  statistical  considerations  alone,  as  illustrated  in  Fig.  4.  For  a 
GaAs  semiconductor  laser  characterized  by  a  photon  energy  of  ~  1.5  eV,  this  corresponds 
to  about  10^®  kflT.  To  represent  1,000  optic^y  in  binary  requires  approximately  14  bits 
(10  bits  for  the  number  plus  4  bits  of  overhead)  at  15  eV  each  (10  photons  at  1.5  eV  each, 
assuming  direct  detection  and  an  ideal  detector),  or  about  10^  k^T. 

For  an  optically  addressed  spatial  light  modulator  operating  over  a  dynamic  range  of 


1000:1  with  1000  x  1000  elements  in  an  active  area  of  1  cm^,  and  at  a  frame  rate  of 
1  MHz,  the  detected  input  flux  must  exceed  9  W/cm*  in  order  to  achieve  a  10“*  BER 
in  each  pixel.  Considerations  such  as  these  have  signiflcant  implications  for  the  SLM 
design,  particularly  with  reg2U’d  to  sensitivity  at  a  given  dynamic  range.  These  implications 
extend  to  the  system  design  as  well,  in  which  case  it  is  important  to  distinguish  between 
single  and  multiple  iteration  aJgorithms  (with  different  BER  requirements  at  each  stage). 
Furthermore,  in  many  applications  the  algorithm  is  really  a  form  of  contraction  mapping, 
in  which  the  desired  output  space  may  be  a  very  small  subset  of  the  transformed  input 
space  (e.g.  determining  whether  or  not  a  given  correlation  peak  exceeds  a  predetermined 
threshold).  In  such  cases,  significantly  relaxed  demands  may  be  appropriate  for  the  BER 
of  individual  SLM  pixels. 

For  envisioned  systems  applications  involving  one  or  more  spatial  light  modulators, 
analysis  of  fundamental  limitations  such  as  that  described  above  is  essential  for  determin¬ 
ing  whether  an  analog  or  digital  approach  is  favorable  from  the  point  of  view  of  a  fixed 
input  power  budget  at  a  given  desired  computational  throughput  rate.  In  general,  a  given 
processing  or  computation  function  can  be  partitioned  into  the  cost  (energy  or  otherwise) 
of  representation,  the  cost  of  computation,  and  the  cost  of  detection  and  utilization  of 
the  answer.  For  operation  at  the  quantum  limits,  analog  representations  are  favored  for 
architectures  and  algorithms  that  implement  a  high  degree  of  computational  complexity 
(irreducible  number  of  equivalent  binary  operations)  per  unit  detected  output  resolution 
element,  whereas  binary  representations  favor  operations  with  a  somewhat  lower  degree  of 
computational  complexity. 

The  current  status  of  spatial  light  modulator  development  depends  strongly  for  its  as¬ 
sessment  on  the  nature  of  the  application  and  its  resultant  requirements.  It  is  interesting 
to  note  at  the  outset  that  for  certain  applications  (such  as  incoherent-to-coherent  conver¬ 
sion  with  high  2uialog  accuracy),  spatial  light  modulator  technologies  have  been  developed 
which  approach  quantum  limited  performance.  On  the  other  hand,  a  broad  spectrum  of 
applications  exists  for  which  current  spatial  light  modulators  fall  far  short  of  such  ultimate 
performance  boundaries.  In  making  such  comparisons,  it  is  of  critical  importance  to  iden¬ 
tify  interrelated  sets  of  performance  parameters  that  cannot  be  arbitrwly  separated,  and 
to  assess  the  conjoint  figure  of  merit  achievable  within  a  given  device  technology  (rather 
than  the  minimum  value  obtained  across  many  different  types  of  devices,  or  even  within 
the  same  device  under  different  operating  conditions). 

A  large  number  of  technological  considerations  apply  to  the  eventual  incorporation 
of  spatial  light  modulators  in  optical  information  processing  and  computing  systems.  For 
systems  of  given  size,  spatial  frequencies  are  inherently  limited  by  the  acceptance  apertures 
of  finite  F-number  lenses.  Phase  modulation,  which  is  capable  of  much  larger  diffraction 
efficiencies  at  a  given  spatial  frequency  than  amplitude  modulation,  is  more  difficult  to 


implement  in  imaging  configurations  and  is  more  sensitive  to  substrate  nonuniformities 
and  polish  figure.  Amplitude  modulation,  on  the  other  hand,  can  present  formidable 
thermal  dissipation  problems  for  applications  involving  spatial  light  modulation  with  high 
optical  throughput  gain.  Applications  exist  for  both  optically  addressed  and  electric£dly 
addressed  spatial  light  modulators,  with  distinct  requirements  for  each.  In  fa^rt,  several 
recently  conceived  applications  such  as  the  utilization  of  spatial  light  modulators  in  neural 
network  implementations  could  advamtageously  employ  both  address  modes  simultaneously. 

Electrically  addressed  spatial  light  modulators  lead  quite  naturally  to  inherently  pixe- 
lated  structures.  Such  structures  cam  be  advantageous  from  severed  points  of  view,  including 
the  convenient  merging  of  opticad  and  electrical  inputs,  interpixel  cross-talk,  strictly  lim¬ 
ited  space- bamdwidth  product,  amd  fixed  system  registration.  However  pixelation  can  yield 
additional  difficulties  such  as  inherently  incomplete  fill  factors,  scattering  from  metadlic 
interconnections,  amd  fixed  pattern  noise.  Similar  types  of  considerations  apply  to  the 
utilization  of  reflective  as  opposed  to  tramsmissive  device  geometries. 

A  final  technological  consideration  that  will  continue  to  strongly  adfect  device  design 
amd  development  is  that  of  avadlable  opticad  sources,  both  CW  amd  pulsed.  For  fixed  (amd 
usuadly  limited)  external  power,  size,  and  weight  considerations,  the  maocimum  achievable 
pulsed  energy  densities  avadlable  set  stringent  requirements  on  the  magnitudes  of  usable 
higher  order  materiad  nonlineaurities.  Recent  au:hievements  of  enhamced  nonlineajities  in 
both  bulk  and  multiple  quamtum  well  compound  semiconductor  structures,  as  well  as  in 
nonlineau*  orgatnic  polymers,  lend  importance  to  the  question  of  whether  or  not  materi¬ 
als  as  well  as  x^^^  materiads  will  provide  useful  spatiad  light  modulation  functions  in  future 
devices. 
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Fig.  1  Schematic  diagram  of  the  principal  elements  of  a  generalized  optical  processor  or 
computer. 
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Fig.  2  Absorption  and  dispersion  of  a  two  oscillator  system,  with  5  meV  linewidths 
displaced  by  15  meV. 


Fig.  3  Power  dissipation  per  bit  as  a  function  of  switching  time,  showing  several  funda¬ 
mental  and  technological  limitations  (after  Ref.  6). 
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Fig.  4  The  probability  of  error  P(ERR)  as  a  function  of  the  number  of  photons  detected 
in  a  given  pixel  of  an  optically  addressed  spatial  light  modulator,  for  both  100:1  and  1000:1 
dynamic  range. 
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Fundamental  and  Technological  Limitations  of  Asymmetric  Cavity 
MQW  InGaAs/GaAs  Spatial  Light  Modulators 


C.  Kyriakakis,  Z.  Karim,  J.  H.  Rilum,  J.  J.  Jung, 
A.  R.  Tanguay,  Jr.,  and  A.  Madhukar 


Optical  Materials  and  Devices  Laboratory, 
and  Center  for  Photonic  Technology 
University  of  Southern  California, 

Los  Angeles,  CA  90089*0483 

Summary 


Numerous  applications  have  been  envisioned  for  spatial  light  modulators  in  optical 
information  processing  and  computing  systems.  The  principtd  functional  roles  of  both  one-  and 
two-dimensional  spatial  light  modulators  include  those  of  format,  input,  output,  CPU,  and 
memory  devices.  Such  a  wide  variety  of  applications  has  of  course  led  to  an  equally  wide  variety 
of  interrelated,  and  at  times  conflicting,  device  requirements  that  arise  from  fundamental  physic^ 
limitations  as  well  as  current  technological  drawbacks.  In  this  investigation  we  have  examined 
these  requirements  for  a  particular  class  of  modulators,  the  operation  of  which  is  based  on  an 
asyinmetric  Fabry-Perot  etalon  with  a  multiple  quantum  well  (MQW)  structure  as  the  cavity 
m^um. 

The  use  of  such  devices  as  reflecticm  noodulators  based  on  AlGaAs/GaAsMQW  structures 
has  been  investigated  by  a  number  of  authors^>2>3.  One  of  the  primary  advantages  of  such 
structures  is  the  considerable  increase  in  both  dynamic  range  of  n^ulation  and  contrast  ratio 
achievable  at  a  given  ^lied  field  strength  in  comparison  with  previously  investigated  transmissive 
configurations,  in  addition  to  the  elimination  of  the  necessity  fOT  counteretching  of  the  (absorptive) 
GaAs  substrate.  On  the  other  hand,  the  dynamic  range  achieved  in  any  reflection  configuration 
must  be  assessed  in  conjunction  with  the  6  dB  ftisertion  loss  characteristic  of  die  intensity  (rather 
than  polarization)  beamsplitter  required  to  inclement  nontud  incidence  readout  This  factw  alone 
rescales  an  80%  change  in  reflectivity  to  less  than  20%  when  considored  firom  the  pmpective  of 
system  rather  than  device  throughput  Furthennore,  such  a  reflection  configuration  presents  the 
additional  systems  complexity  source  isolation  within  each  stage  of  a  (multistage)  processor. 

One  possible  means  of  avoiding  the  6  dB  insertion  loss  is  to  employ  a  polarizing 
beamsplitto’  in  conjunction  with  a  quarter  wave  plate,  thereby  insuring  a  90°  rotation  of  the 
incident  polarization  on  reflection.  This  configuration  ad^  system  coiiq>lexity  and  cost,  as  well  as 
additional  comj^ent  requirements  such  as  the  extinction  ratio  of  the  polarizing  beamsplitter,  and 
the  off-axis  uniformity  of  the  net  phase  retardation  induced  by  the  quarter  wave  plate.  Such 
requirements  must  be  set  so  as  to  not  compromise  the  contrast  ratio  designed  into  tte  reflective 
quitial  light  modulate.  Altonatively,  one  can  employ  MQW  structures  q)Mifically  designed  to  be 
operated  at  other  than  normal  incidence,  with  the  concomitant  complications  implied  by  the 
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associated  anamoiphic  optics,  lateral  rescaling,  depth  of  field,  and  placement  of  critical  arid 
conjunctive  conqxments  with  a^nopriately  low  ^numbers. 

In  mder  to  explore  asymmetric  cavity  MQW  structures  that  are  capable  of  transmissive 
mode  operation  without  resort  to  the  process  difficulty  and  sample  fragility  implied  by 
counteretching,  we  have  examined  the  prospects  of  utilizing  electric-field-addressed  inultiple 
quantum  wells  fabricated  in  the  InGaAs^aAs  system  on  GaAs  substrates.  In  this  investigation, 
we  have  taken  the  perspective  that  identirication  of  fundamental  physical  limitations  to  device 
poformance  is  exceedingly  usefiilas  a  mechanism  for  the  delineation  (and  potential  circumvention) 
of  limitations  that  arise  from  purely  technological  choices.  These  considerations  are  of  particular 
importance  in  the  InGaAs/GaAs  system  due  to  the  fact  that  the  considerable  lattice  mismatch 
characteristic  of  useful  indium  compositions  results  in  a  strained  layer  growth  environment  with 
conctnnitant  implications  on  growth  morphology  and  d\e  necessary  incorporation  of  strain  relief 
mechanisms  (such  as  growth  on  patterned  substrates^).  In  this  paper,  we  describe  several 
important  design  criteria  that  have  evolved  frtmi  this  study,  and  apply  them  to  two  specific  cases: 
(a)  a  transmission  noodulator  with  optimized  dynamic  range  and  contrast  ratio,  and  (b)  a  reflection 
configuration  in  which  the  readout  illumination  is  incident  through  the  (antireflection  coated) 
substrate.  This  latter  configuration  yields  significantly  improved  performance  characteristics  as 
wc^  as  novel  operational  features  at  a  considerable  reduction  in  device  fabrication  complexity. 

An  important  fundamental  physical  limitation  pertains  to  the  degree  of  spatial  light 
modulation  drat  can  be  achieved  per  unit  change  in  a  given  modulation  parameter.  There  are  two 
characteristics  of  waverionts  that  can  be  conveniently  modulated:  amplitude  and  phase.  In  many 
q>tical  infcmnatitm  processing  applications  it  is  desirable  to  modulate  one  or  the  other,  but  not 
both.  As  an  example  of  a  "pure”  amplitude  modulator  we  have  utilized  a  Lorentzian  oscillate 
profile  to  model  a  typical  heavy  hole  exciton  (neglecting  the  band  edge  absorption)  for  an 
InGaAs/GaAs  MQW  structure  with  a  peak  abso^don  of  10,()00  cm-^  and  a  linewidth  (full  width  at 
half  maximum)  7.5  meV  (derived  finxn  experimental  data  on  a  27  period  MQW  structure  with  35 
monolayers  of  InGaAs  (15%  In)  and  71  monolayers  of  GaAs  grown  by  nnolecular  beam  epitaxy 
(MBE)),  as  shown  in  Hg.  1.  Siiiiilarly,  the  response  of  the  absorption  spectrum  to  an  applied  bias 
due  to  the  quantum  conned  Stark  effect  (CjCSE)  for  a  typical  applied  voltage  of  about  100  kV/cm 
and  for  a  total  device  thickness  of  approximately  1.2  pm  (100  A  wells/  200  A  barriers)  is  also 
shown  (dashed)  in  Fig.  1.  Again,  experimental  data  were  used  to  quantize  the  decrease  in 
oscillator  strength  as  well  as  the  exciton  broadening  effects  that  are  ^ic^  for  suc^  systems.  This 
model  illustrates  the  basis  of  operation  of  a  normally-on  amplitude  inodulator  that  utilizes 
electtoabscnrption  at  the  chosen  wavelength  (0.985  pm)  to  generate  the  off-state,  resulting  in  a 
maximum  achievable  change  in  absorption  of  5500  cm'^  In  addition  to  this  constraint,  other 
parameters  that  determine  qrtimum  p^ormance  include  the  device  thickness  and  the  mirror 
reflectivities,  as  well  as  the  width  and  location  of  the  Fabry-Perot  resonance  modes  (which  are  also 
dependent  on  electrofeCractive-induced  perturbations  in  shape  and  position).  Related  to  the  device 
thickness  is  a  critical  cavity  phasing  condition,  which  can  be  set  to  an  odd  or  even  multiple  of  it, 
thus  gjving  rise  to  two  fundamentally  different  operational  irxxles. 

Fcv  the  transmission  modulator  configuration,  application  of  the  oscillator  irxxiel  with  its 
corresponding  index  dispersion  data  as  shown  in  Figs.  1  and  2,  in  conjunction  with  an  optimally 
design^  asynnnetric  cavity,  yields  a  combination  of  fundamental  and  technological  considerations 
that  in  turn  lunit  the  dynamic  range  to  50%  with  a  contrast  ratio  of  10:1,  as  shown  in  Fig.  3.  In 
this  case,  the  muror  reflectivities  ate  both  equal  to  85%.  It  should  be  noted  that  the  signal 
dependoit  phase  noodulation  is  limited  in  this  case  to  approximately  0.03n.  The  fundamental 
reason  for  the  relatively  poor  contrast  ratio  is  the  necessity  in  the  transmission  configuration  to 
phase  the  cavity  such  that  in  the  on  (low  absorption)  state,  the  first  and  second  order  transmitted 
componrats  add  in  phase.  Thus,  extinction  in  the  off  state  relies  on  the  development  of  enough 
vcdtage-iixluced  absinption  change  at  the  qrerating  wavelength  to  completely  eliminate  the  first 


cmler  beam  os  a  single  pass  through  the  tnodulaun*.  The  peak  absorption  coefficient  characteristic 
of  the  InGaAi/GaAs  system  at  £is  composition  is  insufficient  to  produce  more  than  an  10:1 
change  with  a  reasonably  wide  dynamic  range.  It  should  be  noted  in  Fig.  3  that  the  shorter 
wavelength  raults  are  somewhat  artificial  due  to  neglect  of  the  parabolic,  background  and  residual 
absorption  teHs. 

On  the  other  hand,  by  utilizing  a  low  reflectivity  (2S%)  incident  nurror  in  conjunction  with 
a  high  reflectmty  back  minor,  the  best  possible  dynamic  range  achievable  in  reflectiai  is  70%  with 
a  contrast  ratnof  greater  than  2S:1.  as  shown  in  Fig.  4.  This  coidiguration  is  characterized  by  an 
operational  iwKlwidth  of  about  SO  A  and  an  insertion  loss  of  approximately  1.5  dB,  without  taking 
into  account  tie  system  throughput  loss  necessitated  by  the  incident  beamsplitter.  If  a  polarization- 
independent  hearnsplitter  is  employed,  the  system  throughput  loss  in  this  case  is  about  8  dB  with  a 
reduced  dynauc  range  of  17.5%  at  the  same  contrast  ratio. 

Shoidi  a  reflection  modulator  configuration  prove  desirable,  the  transparency  afforded  by 
the  InGaAs/GaAs  MQW  system  on  a  GaAs  substrate  can  be  used  to  advantage  by  inverting  the 
traditional  dence  structure  to  incorporate  the  low  reflectivity  incident  miror  as  a  few  period  Bragg 
mirror  (typicdy  AlGaAsAIaAs)  capped  during  growth  by  the  strained  layer  MQW  structure.  The 
device  may  ien  be  completed  by  deposition  of  either  a  metal  (affording  both  a  contact  as  well  as 
RF  isolation)  or  dielectric  stack  high  reflectivity  minor  on  the  tq>  surface.  The  principal  advantage 
of  this  structKal  configuration  is  the  elimination  of  the  requirement  for  a  (typically)  20  or  more 
period  Bragg  reflector  grown  prior  to  the  precision  growth  stage  necessitated  by  the  desired 
thickness  of  active  MQW  material.  Not  only  does  this  significantly  affect  the  device  processing 
throughput  and  yield,  but  it  also  greatly  enhances  the  quality  of  tire  tinal  interface  prior  to  the 
initiation  MQW  growth. 

Additional  technological  considerations  affecting  the  implementation  of  SLM's  with 
InGaAs/GaAs  MQW  structures  include  issues  of  growth  thickness  accuracy  (such  as  process 
variability  and  array  uniformity),  which  particularly  affect  two  critical  design  parameters:  namely, 
the  oscillator  position  and  linewidth,  and  the  cavity  resonance  condition.  Background  doping 
concentrations  ate  also  known  to  be  particularly  important  in  maximizing  the  contrast  ratio,  as  well 
as  the  voltage  division  between  any  incorporated  Bragg  mirror  (with  >  keT  band  discontinuities) 
and  the  M(JW  active  region. 
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Introduction 

Optical  meroory  disc  technology  has  recendy  matured  and  penetrated  the  personal  computer 
industry  maiket  with  high  density  and  extremely  high  capacity  serial  data  storage  devices.  The 
high  resolution  (submicron)  and  long  life  time  (>  10  years)  characteristic  of  the  optical  disc 
m^um  in  combinatitm  with  image-based  recording  and  an  appropriate  parallel  readout  scheme 
also  make  this  advanced  technology  a  very  con^titive  alternative  for  spatial  light  modulation. 

Utilization  Concept 

There  are  numerous  possible  ways  of  utilizing  the  optical  memory  disc  as  a  spatial  light 
modulator  (SLM).  One  such  configuration  is  as  a  binary  (half-tone)  memory  with  a  parallel  readout 
capacity  of  up  to  10*  bits/cm^.  This  use,  of  course,  implies  strict  design  constraints  on  the 
resolution  of  the  readout  optics  in  order  to  fully  utilize  this  capacity.  More  traditional  SLM 
functions  can  be  incorporated  by  using  the  optical  disc  as  a  binary-encoded  an^og  image 
memory^  in  which  grey  scale  is  obtained  at  the  expense  of  resolution.  Assume,  for  simplicity,  a 
bit  density  of  10*  bitVc^  (which  is  within  reach  of  current  optical  disc  technology)  and  an  image 
size  of  1  cm  X  1  era  The  image  can  then  be  divdded  into  1000  x  1000  pixels,  each  of  size  10  pm  x 
10  pm  with  10  X  10  bits  for  grey  scale  generation.  By  random  area  encoding  the  serially  written 
bits  in  each  pixel,  a  particular  grey  scale  level  directly  proportional  to  the  total  number  of  bits  in 
each  pixel  can  be  obtained.  The  theoretical  dynamic  range  for  this  case  is  100:1,  corresponding  to 
approximately  6.6  grey  scale  bits,  thus  resulting  in  a  total  informatitm  density  of  •  6.6  Mbits/cm^. 
Note  that  it  is  now  only  necessary  to  resolve  each  pixel  (10  pm  x  10  pm)  in  order  to  obtain  the 
desired  continuous  grey  level  in  each  pixel. 


Optical  Readout  Approach 

In  order  to  implement  parallel  (two-dimensional)  readout  of  optical  memory  discs,  it  is 
desirable  to  employ  direct  imagmg  of  the  optical  (Use  to  ot^n  the  maximum  achievable  throughput 
efficiency  and  at  the  same  time  obtain  a  contrast  ratio  in  the  output  image.  An  alternative 
approach  is  to  read  out  the  t^tical  disc  in  parallel  holographically^  by  storing  the  images  in  the 
form  of  two-dimensional  cmnputer  generated  hdograms.  However,  holographic  readout  implies  a 
non-optimum  trade-off  betwMn  tlroughput  efficiency  and  SNR  for  a  given  space-bandwidth 
product  An  optical  reatknit  coiffiguration  witich  has  an  inherently  high  SNR,  is  independent  of 
the  groove  profile  (required  for  the  tracking  servo-system  of  most  disc  &vers),  and  can  be  utilized 
widi  a  variety  of  recording  layer  principles  (such  as  ablation  or  bump  forming)  can  be  achieved  by 
using  a  differential  (or  shearing)  interferometric  approach^  In  this  readout  scheme,  two  spatially 
cohooit  images  of  the  qttical  diic  ate  cteued  and  interfered  differentially  Ity  introducing  a  shift  of 
one  half  of  die  minimum  bit  pitch  alcmg  the  directitm  of  the  grooves  of  a  distance  (Fig.  la).  In  die 
ouqiut  inu^  only  local  difimnoes  in  phase  (height)  and/or  reflectivity  of  the  recording  layer  will 
be  detected.  Thus  each  bit  will  produce  two  brij^t  marks  (A/2  tqiart)  on  a  dark  background  in  the 
oumut  image  (Hg.  lb).  Maximum  ouqiut  intensity  will  consequently  be  obtained  for  recording 
wim  a  50%  duty  cycle  pattern. 


Optical  Readout  Configuration 

The  differential  interferometric  readout  concept  can  be  irr^lemented  by  using  a  shear  plate, 
vriiich  consists  of  a  birefiingent  plate  widi  a  tilted  optic  axis  (Fig.  2).  When  imaging  the  optical 
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disc  through  the  shear  plate,  two  images  with  orthogonal  polarizations  are  produced  and  are 
by  an  apprc^riate  amount  as  determined  by  the  recorded  Ut  length.  In  order  to  obtain  the  optimum 
phase  shift  of  tc  ±  2mc  (n  is  an  integer)  between  the  two  images,  the  optical  path  length  difference 
between  the  two  orthogonal  polarizations  can  be  set  by  tilting  the  shear  plate,  or  pre-adjusted  by  a 
variable  compensator.  The  output  analyzer  produces  the  ^sired  interference  between  the  two 
orthogonally  polarized  images.  In  practice,  the  shear  plate  tilt  (and/or  compensator)  and  Ae 
analyzer  are  tuned  for  maximum  extinction  of  the  reflected  light  firom  an  unwritten  area  on  the  di% 
in  order  to  obtain  the  highest  possible  contrast  ratio  in  the  output  image. 

Design  Considerations 

The  optical  readout  configuration  shown  in  Fig.  2  is  very  flexible  in  design.  The  crucial 
element,  the  shear  plate,  can  be  manufactured  in  several  ways.  For  exarr^le,  a  uniaxial  crystal  plate 
with  a  tilted  optic  axis  can  be  used  as  shown  in  Fig.  3.  The  shear,  S,  of  such  a  plate  is  given  by 

^  = — 5 - 5 — 5 — * 

rig  +  itg  tan  Og 

in  which  d  is  the  width  of  the  plate,  is  the  optic  axis  tilt  angle,  and  and  rig  are  the  ordinary  and 
extraordinary  indices  of  refraction  of  the  uniaxial  crystal,  respectively.  Since  5  is  to  first  order 
directly  proportional  to  An  =  rig-  rif,,  and  since  the  shear  requir^  (pm-range)  is  «  d,  it  is  desirable 
to  choose  a  uniaxial  crystal  with  a  small  birefringence.  Quartz,  for  example,  has  an  index 
difference  An  »  0.009  (at  X  =  632.8  nm),  and  in  addition  exhibits  go<^  polishing  characteristics. 

SLM  Output  Characteristics 

As  test  patterns,  three  bands  were  written  on  two  different  types  of  discs:  write-once 
ablative  media  discs  (amorphous  alloy)  of  the  dark  mark  type,  and  erasable  bump  forming  media 
(dye  polymer)^.  Each  band  was  approximately  0.5  mm  in  width  and  written  at  a  duty  cycle 
corresponding  to  a  specific  grey  scale  level.  The  three  duty  cycles  used  were  50%  (maximum 
output  intensity),  25%  and  12.5%,  all  written  using  5  pm  long  bits.  Using  the  differential 
interferometric  readout  configuration  with  a  5  pm  shear,  the  three  bands  for  both  media  were 
clearly  extracted  with  the  appropriate  grey  scale  level  factor  of  two  between  adjacent  bands.  ITie 
measured  output  intensity  normalized  to  the  highest  grey  scale  level  (the  50%  duty  cycle  band)  as  a 
function  of  duty  cycle  for  both  media  is  shown  in  Fig.  4.  The  SLM  output  characteristic  is  seen  to 
be  very  linear  for  both  media.  However,  the  currently  obtained  contrast  ratio  is  much  higher  for 
the  bump  forming  medium  («  20:1)  than  for  the  ablative  medium  (->  5:1).  The  main  reason  for  the 
lower  contrast  ratio  for  the  ablative  medium  is  due  to  interference  from  the  other  (transmissive) 
recording  layer  on  the  two-sided  optical  disc.  The  ablative  recording  layer  is  furthermore  read  out 
through  a  non-AR  coated  polycarbonate  substrate»(1.2  mm  thick)  that  may  exhibit  some  residual 
birefringence.  The  bump  forming  medium,  on  the  other  hand,  has  a  non-transmissive  air  incident 
recording  layer.  The  contrast  ratio  Of  the  bump  forming  medium  could  be  funher  increased  by 
using  high  quality  (glass)  substrates  and  higher  uniformity  recording  layers  with  fewer 
(depolarizing)  surface  blemishes. 

Differential  Interferometric  Readout  Performance 

The  maximum  throughput  efficiency  for  an  optical  disc  medium  in  an  otherwise  ideal 
differential  interferometric  readout  configuration  is  given  by 

1  r  2  2  f  2h " 

^medium  =  +  k/|  ” 2 •  M ■  |r;| •  cosi  2jw  •  y  +  arg{ro} - arg{/j 

in  which  ro  and  r/  are  the  (complex  valued)  amplitude  reflectivities  of  the  (unwritteii)  recording 
layer  and  the  written  bit  respectively,  h  is  the  height  of  the  written  bit  and  n  is  the  refractive  index 
of  the  optical  disc  substrate  (for  substrate  incident  media).  The  highest  obtainable  efficiency  is 
achieved  for  an  (ideal)  bump  forming  medium  (shown  in  Fig.  5)  for  which  rp  =  r/  and  h  = 
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±V4n.  That  is.  the  efficiency  is  directly  proportional  to  Irpl^  with  a  unity  constant  (the  efficiency 
factor  is  unity).  However,  fcv  an  ide^  dark  mark  ablative  medium  in  -  0  and  h  =  0),  the 
efficiency  is  directly  {voportional  to  ir0iV4;  that  is,  the  efficiency  factor  is  HA.  Most  ablative  media 
in  fact  cmsprise  amiorphous  alloy  recording  layers  with  complex  valued  indices  of  refraction 
characterized  by  a  non  zero  bit  reflectivity  (rj),  which  may  in  fact  boost  the  effective  efficiency 
factor  beyond  1/4.  For  our  experimental  readout  configuration,  the  total  measured  throughput 
efficiency  (relative  to  the  collimated  laser  beam)  of  die  SLM  for  the  two  media  was  0.9%  for  the 
bump  forming  medium  and  3.0%-for  the  ablative  medium^.  These  measurements  however 
included  the  losses  in  the  readout  optics  (polarizing  beam  splitter,  shear  plate  and  imaging  lenses) 
for  which  the  throughput  efficiency  was  independently  measured  to  be  0.85.  The  remaining  losses 
can  be  divided  into  two  separate  parameters:  losses  due  to  the  optical  disc  medium  (limited 
recording  layer  reflectivity,  air-substrate  intoface  reflection  loss  a^  diffraction  from  grooved 
substrates)  and  loss  due  to  the  data  pattern  itself,  which  involves  the  shape  and  size  of  the  written 
Ints  (efficiency  factor,  fill  factor  of  bits,  and  diffiactimi  losses  due  to  limited  numerical  aperture). 
The  optical  disc  medium  losses  are  mainly  determined  by  the  reflectivi^  of  the  recording  layer, 
which  was  40%  for  the  ablative  medium  and  13%  for  the  bump  forming  medium.  In  another 
experiment,  the  bump  forming  medium  was  also  coared  with  a  30  nm  ^uminum  layer,  which 
boated  the  reflectivity  to  86%  and  the  total  throughput  efficiency  to  5.8%.  The  total  data  pattern 
efficiencies  were  approximately  the  same  for  both  media  (0.1 1  for  the  ablative  and  0.08  for  the 
bun^  forming).  Tlw  relatively  low  data  pattern  efficiency  for  the  bump  forming  medium  is  di»  to 
the  non-ideal  shape  of  the  bumps  and  a  slightly  lower  fill  factor  of  •  0.40  versus  0.63  for  the 
ablative  medium.  By  increasing  the  fill  factor  and  the  numerical  aperture  of  the  readout  optics,  and 
by  optimizing  the  AR-coatings,  the  (complex  valuol)  recording  layer  reflectivity  (e.g.  by  using 
separate  write  and  parallel  read  wavelengths)  and  the  bump  shape  (if  applicable),  the  totid 
timxighput  efficiency  can  be  substantially  incret^ed. 
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Figure  3.  Shear  plate. 


Figure  4a.  SLM  output  characteristic. 


Figure  4b.  SLM  output  characteristic. 
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Figure  5.  Readout  geometry  of  a  bump  forming  optical  disc  medium. 


